Phenotype generation from EMR by tensor factorization SEDI Durham Cohort James Lu M.D. Ph.D. Department of Electrical and Computer Engineering Department.

Slides:



Advertisements
Similar presentations
Copyright © 2014 American Medical Group Foundation. All rights reserved. August 12, 2014 Initial Feedback Report: Hypertension Prevalence and Control.
Advertisements

© HealthFitness 2012 Integrated Health Coaching: The Next Generation in Health Behavior Change Management Dennis Richling, Chief Medical and Wellness Officer.
Preventable Hospitalizations: Assessing Access and the Performance of Local Safety Net Presented by Yu Fang (Frances) Lee Feb. 9 th, 2007.
The Research Question Alka M. Kanaya, MD Associate Professor of Medicine, Epidemiology & Biostatistics UCSF October 3, 2011.
Simulation of “forwards-backwards” multiple imputation technique in a longitudinal, clinical dataset Catherine Welch 1, Irene Petersen 1, James Carpenter.
Quality Improvement Research Methods: Issues in Detecting Changes in Clinician Performance Haya R. Rubin, M.D., Ph.D. Lynne Nemeth, R.N., Ph.D. Hoangmai.
Area 4 SHARP Face-to-Face Conference Phenotyping Team – Centerphase Project Assessing the Value of Phenotyping Algorithms June 30, 2011.
CVD prevention & management: a new approach for primary care Rod Jackson School of Population Health University of Auckland New Zealand.
Department of Health and Human Services Measuring Clinical Lab Ordering Quality: Theory and Practice Steven M. Asch MD MPH VA, RAND, UCLA April 29, 2005.
Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2009.
University of Pittsburgh Department of Biomedical Informatics Healthcare institutions have established local clinical data research repositories to enable.
RACIAL DISPARITIES IN PRESCRIPTION DRUG UTILIZATION AN ANALYSIS OF BETA-BLOCKER AND STATIN USE FOLLOWING HOSPITALIZATION FOR ACUTE MYOCARDIAL INFARCTION.
Study: Statins increase life expectancy Detroit News, Associated Press
Journal Club Alcohol, Other Drugs, and Health: Current Evidence May-June 2008.
Azara Proprietary & Confidential Controlling High Blood Pressure 2014 Measure Changes Improving Patient Outcomes through Data.
Cohort Studies Hanna E. Bloomfield, MD, MPH Professor of Medicine Associate Chief of Staff, Research Minneapolis VA Medical Center.
INTRODUCTION TO ICD-9-CM
Overly concerning and falsely reassuring?? FRAMINGHAM RISK FACTORS IN THE ED.
Dr K N Prasad MD., DNB Community Medicine
Opportunities for Big Data in Medical Records Mike Conlon, PhD
The Economic Impact of Intensive Case Management on Costly Uninsured Patients in Emergency Departments: An Evaluation of New Mexico’s Care One Program.
Elizabeth Karlson, MD Associate Professor of Medicine
EHRS as a Tool to Improve BP Control 1.Brief history of OQIUN, CCI. Began 1999 using data cards. Started working with multiple practice sites using different.
Source: Site Name and Year IHS Diabetes Audit Diabetes Health Status Report ______Site Name_________ Health Outcomes and Care Given to Patients with Diabetes.
... a KAISER PERMANENTE Innovation IndiGO: Tailoring Guidelines to Individuals David M Eddy MD PhD Founder and Chief Medical Officer Emeritus Archimedes.
Risk Adjustment Data For Business Insight Health Care Service Corporation September 2012.
1 Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2014.
William B. Kannel, MD, FACC Former Director, Framingham Heart Study
1 NHLBI/NEI National Institutes of Health NHLBI/NEI National Institutes of Health.
Universal Adoption of the EHR What is Meaningful Use and why should it be important to me?
Improving Hypertension Quality Measurement Using Electronic Health Records S Persell, AN Kho, JA Thompson, DW Baker Feinberg School of Medicine Northwestern.
CAS Seminar on Health and Managed Care Benchmarking Measures and Quality Control October 18-19, 1999 page CAS Seminar on Health and Managed Care.
Dr K N Prasad Community Medicine
Mrs. Watcharasa Pitug ID The Association between Waist Circumference and Renal Insufficiency among Hypertensive Patients 15/10/58 1.
U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES Centers for Disease Control and Prevention National Center for Health Statistics 1 Monitoring Million Hearts.
Office of Statewide Health Planning and Development Day for Night: Hospital Admissions for Day Surgery Patients in California, 2005 Mary Tran, PhD, MPH.
BlueCross BlueShield of Tennessee, Inc., an Independent Licensee of the BlueCross BlueShield Association. This document has been classified as public Information.
Referral Request and Referral Report Connie Sixta, PhD.
A Claims Database Approach to Evaluating Cardiovascular Safety of ADHD Medications A. J. Allen, M.D., Ph.D. Child Psychiatrist, Pharmacologist Global Medical.
MHSPHP Metrics Forum July 2013
Clinical Health Indicator Improvements and Hospital Usage Report Health Integration Project December 2013 Matthew Rich Matthew Rich – Health Integration.
Improving Care Coordination and Readmissions Using Real Time Predictive Analytics from an HIE New Jersey / Delaware Valley HIMSS Conference Atlantic City,
Long-term Cardiovascular Effects of 4.9 Years of Intensive Blood Pressure Control in Type 2 Diabetes Mellitus: The Action to Control Cardiovascular Risk.
Ross T. Tsuyuki, BSc(Pharm), PharmD, MSc, FCSHP, FACC Yazid NJ Al Hamarneh, BPharm, PhD Charlotte Jones, MD, PhD, FRCP(C) Brenda Hemmelgarn, MD, PhD, FRCP(C)
Uses of the NIH Collaboratory Distributed Research Network Jeffrey Brown, PhD for the DRN Team Harvard Pilgrim Health Care Institute and Harvard Medical.
Vitamin D: A New Frontier in Diabetes Management Contact Information: Background Acknowledgement Methods.
Carina Signori, DO Journal Club August 2010 Macdonald, M. et al. Diabetes Care; Jun 2010; 33,
Department of Public Health Medically Indigent Care Reporting System (MICRS) Medically Indigent Services Program (MISP) 1.
Estimating Absolute Risk Reductions Associated with Interventions in Patients with Type 2 Diabetes Jim Mold, M.D., M.P.H. Brian Firestone, MS2.
A ssociation of Public Health Observatories Hospital Activity data Roy Maxwell SWPHO & Bristol University Dr Richard Wilson Sandwell PCT.
GO! Diabetes Train the Trainer Program. Practice Performance and Improvement.
Hashem B. El-Serag,1,2 Eric A. Engels,3 Ola Landgren,3 Elizabeth Chiao,1,2 Louise Henderson,1,2 Harshinie C. Amaratunge,1,2 and Thomas P. Giordano1,2 R1.
Dr John Cox Diabetes in Primary Care Conference Cork
High-Throughput Machine Learning from EHR Data
Referral Request and Referral Report
C Wilson, KM Rhodes, RA Payne
Diagnosing Diabetes and Predicting Complications
Group 7 Hospital Readmission Predictive Analytics
. Troponin limit of detection plus cardiac risk stratification scores for the exclusion of myocardial infarction and 30-day adverse cardiac events in ED.
The Importance of Adequately Powered Studies
AN OVERVIEW OF JOHN HOPKINS ACG-PREDICTIVE MODELING TOOL AND UNDERSTANDING RESOURCE UTILIZATION BANDS (RUBs)
Mortality Prognostic Model for Peripheral Arterial Disease
Vanguard Phase Results for the Blood Pressure Component
Alina M. Allen MD, Patrick S. Kamath MD, Joseph J. Larson,
Development and Validation of HealthImpactTM: An Incident Diabetes Prediction Model Based on Administrative Data Rozalina G. McCoy, M.D.1, Vijay S. Nori,
IBH, Cost (Risk Adjusted)
National STD Conference 2008
Time to CARE: A collaborative engine for practical disease prediction
Predicting Pneumonia & MRSA in Hospital Patients
Issues With Synthea Generated Patient Data for Use in Clinical Quality Measure – Hypertension Case Study Peter Li, OSEHRA 1/24/2019.
Presentation transcript:

Phenotype generation from EMR by tensor factorization SEDI Durham Cohort James Lu M.D. Ph.D. Department of Electrical and Computer Engineering Department of Medicine

3.2 Trillion / yr (~21% of GDP) Health System Under Pressure

Small Molecules, Medical Devices, Biologics, diagnostics, genomics, transcriptomics…. OperationsNovel technology Align incentives, risk sharing, quality metrics, reducing readmissions, six sigma/ lean, … Where do I achieve cost arbitrage? How do we identify which patients to study? Where is my patient going to do next? Can we reorganize patient flow?

Computable phenotypes are a top down process PheKB, Northwestern

Many variations of computable phenotypes require adjudication by physicians. Richesson, et al Expensive and time consuming

EMR Data is large and Complicated Durham County, Patient level  >240,000 patients  Birthday  Death (where available)  Gender  Race  Ethnicity Visit level  4.4 Million patient visits  Average 18 measurements recorded per visit  Indicator of presence/absence of particular diseases (computed)  Encounter date (start, end)  Location (DHRH, DUH, DRH)  Path (ED -> inpatient for example)  Inpatient / Outpatient > 60,000 types of observations CPT ICD9 diagnoses ICD9 procedures Lab values Medications Vitals Intervention level Caveats: Temporal gaps – People are only patients when they are sick We want to incorporate all of this information Don’t want to be fooled by mistakes and bias

Decompose each touch with the health care system into its parts ● Each visit is a 5-D tensor (~1 billion elements) ● Patient ● Diagnosis/ Billing Codes ● Labs ● Medications ● Time ● Model as Counts ● Decompose into set of K rank 1 vectors With Piyush Rai and Changwei Hui Codes Labs Medications Time

Computational phenotypes are a bottom-up process. Factors represent latent phenotypes Evaluate pts with ~23MM data-points with morbidity outcomes in diabetes Alprazolam Urate Factor 2 Factor 10 Malignant Neoplasm Prostate Clinical Trial Participation Secondary Malignant Neoplasms of Bone External Catheter Set CEA AG 15-3 Allopurinol Evening Primrose Oil Systemic Lupus Erythematosus Side Effects from Statins Shoulder Pain Calcidiol Jo-1

Patients are composites of common and rare latent phenotypes. ER/ EKG Standard Labs (i.e. CBC/ BMP) Kidney Disease Hypertension Surgical Patient Patient by Factor Score Matrix, 40 most common phenotypes

Compare Outcome prediction to Known Algorithm (UKPDS)  UKPDS: UK Prospective Diabetes Study outcomes model used to predict MI, Death, and Stroke  7 demographic + lab variables:  age, ethnicity, smoking status  A1c, HDL, Total Cholesterol and Systolic BP  Dataset  Original 7 variable model  All Data  Non Matrix Factorization  Tensor Factorization  Can we predict outcome in next year  Death  AMI  Stroke  Classification Model:  Fit data with Random Forests  10 fold cross validation With Joseph Lucas

Tensor derived factors performs better than original UKPDS in all outcomes, provides comparable performance to “all-data” model Stroke is similar to Dat