Data Science + the EHR Noémie Elhadad

Slides:



Advertisements
Similar presentations
Design and Implementation of a Web-Based Patient Portal Linked to an Ambulatory Care Electronic Health Record: Patient Gateway for Diabetes Collaborative.
Advertisements

Consolidation Communicable Diseases User Stories: Meeting Agenda 1.News from other domains 2.Recap of a previous meeting 3.Consolidation of three more.
CDM Registry Project Dr. Richard Lewanczuk Regional Medical Director Chronic Disease Management Capital Health.
Computerization of the practice Grzegorz Margas, M.D., Ph.D. Department of Family Medicine Jagiellonian University Medical College.
Text Mining of Medical Documents Michael Elhadad - Raphael Cohen Dept of Computer Science.
1 Data-driven Approaches to Augment Clinical Decision in EMR Era Health Data Initiative Forum III The Health Datapalooza Ketan Mane Renaissance Computing.
Overview of Longitudinal Coordination of Care (LCC) Presentation to HIT Steering Committee May 24, 2012.
Components of EHR Farrokh Alemi, Ph.D.. Definitions Electronic Medical Record Electronic Medical Record Electronic Health Record Electronic Health Record.
Proposed Meaningful Use Criteria for Stage 2 and 3 John D. Halamka.
ETIM-1 CSE 5810 CSE5810: Intro to Biomedical Informatics Mobile Computing to Impact Patient Health and Data Exchange and Statistical Analysis Presenter:
Use of clinical laboratory databases to enable early identification of patients at highest risk of developing end- stage kidney disease Dr David Kennedy.
Vision of how informatics enables a transformed health system Joyce Sensmeier MS, RN-BC, CPHIMS, FHIMSS, FAAN Vice President, Informatics, HIMSS President,
Chapter 2 Electronic Health Records
Medicare & Medicaid EHR Incentive Programs
Medical informatics Lecture 1
Implementation of Enterprise Wide Speech Recognition, Text-based Documentation and Automated Document Distribution May 27, 2013 Michelle Leafloor.
Health Information Technology: EHRs and Beyond Thomas Sequist, MD MPH.
Decision Support for Quality Improvement
E-Referral enabled collaborative health care Opportunities and considerations Presented by: Sasha Bojicic Emerging Technology Group Canada Health Infoway.
Component 10 – Fundamentals of Workflow Process Analysis and Redesign
Quality Improvement HIT Design to Support Teamwork and Communication Lecture a This material (Comp12_Unit7a) was developed by Johns Hopkins University,
*To Err is Human: Building a Safer Health System. National Academy Press, 2001 Why is DynaMed Needed? Between 44,000 and 98,000 American deaths per year.
My Own Health Report: Case Study for Pragmatic Research Marcia Ory Texas A&M Health Science Center Presentation at: CPRRN Annual Grantee Meeting October.
The Electronic Medical Record: A Tool for Teaching Nancy B. Clark, M.Ed. Director of Medical Informatics Education, FSU College of Medicine For GRIPE Annual.
Working with Health IT Systems Potential Issues with Adoption and Installation of an HIT System This material (Comp7_Unit9) was developed by Johns Hopkins.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
National Efforts for Clinical Decision Support (CDS) Erik Pupo Deloitte Consulting.
Interoperability Showcase In collaboration with IHE Use Case 3 Care Theme: Leveraging National Healthcare Registries in Care Delivery Biosurveillance Monitoring.
ALBERTA HEALTH SERVICES ITAC Alberta eHealth Update October 2009.
The Ottawa Hospital Clinical Category: e-Reports of Services ImagineNation Challenge Clinical Document Manager Electronic Discharge Summary.
Copyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. McGraw-Hill Chapter 7 Introduction to Practice Partner Electronic Health Records.
Applying Science to Transform Lives TREATMENT RESEARCH INSTITUTE TRI science addiction Mady Chalk, Ph.D Treatment Research Institute CADPAAC Conference.
The Status of Health IT in British Columbia Elaine McKnight.
Unit 5a: Care Coordination HIT Design for Teamwork and Communication This material was developed by Johns Hopkins University, funded by the Department.
Provider Data Migration and Patient Portability NwHIN Power Team August 28, /28/141.
Using the Electronic Health Record to Encourage Evidence-Based Practice Jonathan S. Einbinder, MD, MPH Partners HealthCare
1 The Effect of Primary Health Care Orientation on Chronic Illness Care Management Julie Schmittdiel, Ph.D., Stephen M. Shortell, Ph.D., Thomas Rundall,
Physicians and Health Information Exchange (HIE) The Value of HIE to a Physician’s Practice and Consumers.
Health Management Information Systems Unit 3 Electronic Health Records Component 6/Unit31 Health IT Workforce Curriculum Version 1.0/Fall 2010.
This material was developed by Oregon Health & Science University, funded by the Department of Health and Human Services, Office of the National Coordinator.
Validation and Refinement of a Prediction Rule to Identify Children at Low Risk for Acute Appendicitis Kharbanda AB, Dudley NC, Bajaj L, et al; Pediatric.
Mount Auburn Practice Improvement Program (MA-PIP)
dWise Healthcare Bangalore
Improving Care Coordination and Readmissions Using Real Time Predictive Analytics from an HIE New Jersey / Delaware Valley HIMSS Conference Atlantic City,
Fundamentals of Workflow Analysis and Process Redesign
Unit 7a: Workflow Assessment Safe Workflow Design This material was developed by Johns Hopkins University, funded by the Department of Health and Human.
The Medical Record, Documentation, and Filing
Canadian Best Practice Recommendations for Stroke Care Recommendation 1: Public Awareness and Patient Education (Updated 2008)
History of Health Information Technology in the U.S. History of Electronic Health Records (EHRs) Lecture a – Early EHR Prototypes This material Comp5_Unit6.
N VISUAL ANALYTICS FOR HEALTHCARE: BIG DATA, BIG DECISIONS David Gotz Healthcare Analytics Research Group IBM T.J. Watson Research Center.
Bronx Health Access: IT Requirements Gathering IT REQUIREMENTS GATHERING 1.
Chapter 1 Introduction to Electronic Health Records Copyright © 2011 by Saunders, an imprint of Elsevier Inc.
Health Management Information Systems Unit 3 Electronic Health Records Component 6/Unit31 Health IT Workforce Curriculum Version 1.0/Fall 2010.
History of Health Information Technology in the U.S. The HITECH Act Lecture b – Meaningful Use, Health Information Exchange and Research This material.
Tim Friede Department of Medical Statistics
Medical Records.
Electronic Medical and Dental Record Integration Options
Patient Centered Medical Home
Electronic Health Records (EHR)
Jessica Lobban, PGY-3 CCLP Family Medicine Residency Program
NYSDOH AIDS Institute Quality of Care Program eHIVQUAL
Leveraging Payer Data to Jumpstart RHIOs
Comparing automated mental health screening to manual processes in a health care system Josh biber.
Electronic Health Records
Component 11 Unit 7: Building Order Sets
Risk Stratification for Care Management
Text Mining of Medical Documents
Presentation transcript:

Data Science + the EHR Noémie Elhadad

software/d/d-id/ ? mess/2013/09/27/651a81f e3-b75d-5b7f _story.html disaster disruptive-and-here-to-stay/ health-records dissatisfied-ehr-systems

Today we’ll be talking about What’s the story of the patient I am taking care of? What is my patient at risk for? Information overload Disease Progression Prediction Summarization of patient record – Information needs – Summarization strategy – Deployment

Information overload Present at all levels of care – Primary / inpatient / emergency care – Health information exchange EHR data is cognitively taxing to navigate – Lots of it – Heterogeneous data – Primarily organized chronologically -McDonald (1976) Protocol-based computer reminders, the quality of care and the non-perfectability of man. N Engl J Med. -Christensen & Grimsmo (2008) Instant availability of patient records, but diminished availability of patient information: a multi- method study of GP’s use of electronic patient records. BMC Med Inform Decis Mak. -Chase et al (2009). Voice capture of medical residents’ clinical information needs during an inpatient rotation. J Am Med Inform Assoc. -Schapiro et al (2006) Approaches to patient health information exchange and their impact on emergency medicine. Ann Emerg Med. -Adler-Milstein et al (2011) A survey of health information exchange organizations in the United States: implications for meaningful use. Ann of Intern Med. -Stead & Lin (2009) Computational technology for effective health care: immediate steps and strategic directions. National Research Council of the National Academies. -Singh et al (2013) Information overload and missed test results in electronic health record-based settings. JAMA Intern Med.

What’s my patient at risk for? Disease progression prediction – Chronic kidney disease State of the Art for risk prediction models for CKD – Varying model type (Logistic, Cox) – Varying features (demographics, eGFR, diagnoses, laboratory tests) – Varying outcomes (creatinine, eGFR, complications, kidney failure)

Our goal Use longitudinal, heterogeneous data sources to predict risk of a near-term CKD outcome that should be sensitive to short-term medical decisions. In contrast to previous studies, we: – Use EHR data – Use longitudinal data (up to 20 years back) – Use heterogeneous data (demographics, labs, notes) – Use stage III CKD as a trigger for prediction and stage IV as the outcome. Perotte et al (2015) Risk Prediction for Chronic Kidney Disease Progression Using Heterogeneous Electronic Health Record Data and Time Series Analysis..J Am Med Inform Assoc.

Data + Models Data – ~20k patients visiting primary care clinic – ~3k with stage III CKD and ~307 with stage IV CKD 5 predictive models compared – all incorporated into a basic cox – eGFR – Estimated glomerular filtration rate – RLT – Recent Laboratory tests – TKF – Text Kalman filter – LKF – Laboratory test Kalman filter – LTKF – Laboratory test and Text Kalman filter

eGFR and RLT (recent lab tests) models Stage III CKD Stage IV CKD Time Prediction

TKF (Text Kalman Filter) Stage III CKD Stage IV CKD Time Prediction

LKF (Lab Kalman Filter) Stage III CKD Stage IV CKD Time Prediction

LTKF (Lab & Test Kalman Filter) Stage III CKD Stage IV CKD Time Prediction

Methods Component models – Model of text (latent Dirichlet allocation (LDA)) K=50 – Model of the past (Kalman Filter) Discrete time, binned by month, observations included 19 laboratory values and notes represented as log transformation of topic proportions – Model of the future (Cox proportional hazards) Covariates include Kalman filter latent values at stage III onset, Kalman filter offsets, and demographics. Dependent variable is time to stage IV

Results *=p<0.05, **=p<0.01, ***=p<0.001 Δ LTKFΔ LKFΔ TKFΔ RLTΔ eGFRConcordance LTKF **** LKF *** **0.836 TKF *** RLT **0.819 eGFR 0.779

Results – risk factors Topic 3 (heart failure)Topic 32 (diabetes)Topic 29 (dialysis) lasixunitsq15 volumeinsulinDialysis edemasubcutaneousFistula heartlantusVolume failureglucoseBid worseningdiabetesLasix diuresistimesPlacement severe70/30Improved diastolicdiabeticHeparin overloaddaysExamined

Results – protective factors Topic 33 (family history) Topic 35 (health maintenance) Topic 41 (non- specific) Topic 43 (gynecological) Topic 45 (asthma) died historybreastAlbuterol ageflupressurevaginalAsthma yearsvisitratemammoInhaled motherfastingcountcancerLung fathercolonoscopythreehxobstructive brotheryearrevealedpapWheezing sistershottimesnlAdvair workedvaccineshortnessagePulm childrenwnldischargedwillrestrictive deceasedcheckcreatinineendometrialPuffs

Predictive modeling on EHR data Incorporating longitudinal information helps Incorporating types of evidence (text, labs) helps Meaningful data science + EHR: – How to make this type of predictions useful for clinicians? – How to make them useful within their workflow?

Back to information overload Present at all levels of care – Primary / inpatient / emergency care – Health information exchange EHR data is cognitively taxing to navigate – Lots of it – Heterogeneous data – Primarily organized chronologically -McDonald (1976) Protocol-based computer reminders, the quality of care and the non-perfectability of man. N Engl J Med. -Christensen & Grimsmo (2008) Instant availability of patient records, but diminished availability of patient information: a multi- method study of GP’s use of electronic patient records. BMC Med Inform Decis Mak. -Chase et al (2009). Voice capture of medical residents’ clinical information needs during an inpatient rotation. J Am Med Inform Assoc. -Schapiro et al (2006) Approaches to patient health information exchange and their impact on emergency medicine. Ann Emerg Med. -Adler-Milstein et al (2011) A survey of health information exchange organizations in the United States: implications for meaningful use. Ann of Intern Med. -Stead & Lin (2009) Computational technology for effective health care: immediate steps and strategic directions. National Research Council of the National Academies. -Singh et al (2013) Information overload and missed test results in electronic health record-based settings. JAMA Intern Med.

Patient record summarization “The act of collecting, distilling, and synthesizing patient information for the purpose of facilitating any of a wide range of clinical tasks” Previous approaches focus on specific disease focus on specific care setting (ICU) largely ignore EHR text deployment and study of impact is lacking Flebowitz et al (2011) Summarization of clinical information: a conceptual model. J Biomed Inform. Pivovarov and Elhadad (2015) Automated methods for summarization of electronic health records. J Am Med Inform.

How clinicians summarize patient information 92yo woman Reichert et al (2010) Cognitive analysis of the summarization of longitudinal patient records. AMIA Annu Symp.

How clinicians summarize patient information Average time in each section of the EHR On average, physicians spent 50% of their time in the Notes section 25% of their time in the Laboratory section Reichert et al (2010) Cognitive analysis of the summarization of longitudinal patient records. AMIA Annu Symp.

How clinicians summarize patient information Reichert et al (2010) Cognitive analysis of the summarization of longitudinal patient records. AMIA Annu Symp. All physicians visited the “Notes” section first No established ordering of summary content -Problem-oriented view of the patient 0 min 5 min 10 min 15 min 20 min 25 min 30 min

Functionality wish list for an EHR summarizer Aggregate information from the whole record But allow for zooming in and out of particular parts of the record Use notes as primary content selection source Facilitate finding supporting evidence in documentation Be problem oriented Be interactive Update in “real time”

HARVEST Extracts content from a patient’s longitudinal documentation Aggregates information from multiple care settings Visualizes content through a timeline of a patient’s problem documentation and clinical encounters Distributed computing infrastructure Deployed at NewYork-Presbyterian hospital Hirsch et al (2014) HARVEST, a longitudinal patient record summarizer. J Am Med Inform Assoc.

HARVEST

Natural language processing of clinical documentation Extract problems mentioned in all the notes of a record – Conditions, as well as signs and symptoms Compute salience of problem documentation for a given time frame in a patient record Challenges – Robust processing across all note types – Identify and merge problems that are semantically similar – Handle redundancy within longitudinal record Pivovarov & Elhadad (2012) A hybrid knowledge-based and data-driven approach to identifying semantically similar concepts. J Biomed Inform. Cohen et al (2013) Redundancy in electronic health record corpora: analysis, impact on text mining performance, and mitigation strategies. BMC Bioinform. Cohen et al (2014) Redundancy-Aware Latent Dirichlet Allocation for Patient Record Notes. PloS ONE. Hirsch et al (2014) HARVEST, a longitudinal patient record summarizer. J Am Med Inform Assoc.

Natural language processing of clinical documentation Distributed infrastructure – 650,000 notes/month avg. are authored at NYP – 20,000 notes/second parsing and indexing (compared to 500 notes/second in a non-distributed infrastructure)

Use cases “What’s the story?” – Hospital admission – Walk-in at clinic – ED visit Chart biopsy Quality indicators Researchers and trial coordinators Education “This is the cooest [sic] thing every. VERY useful to gather information when a patient arrives in the ED. If this were YELP I would given you a 5. Really.” “The word cloud in the problem list is simply the most useful EMR function I have ever come across! Being able to link word associations and problems throughout enormous medical records helps with rapid data mining and is immediately useful in the emergency setting.”

Conclusions EHR summarization – Robust NLP of underlying data – Information visualization – Computing infrastructure to enable operational summarization Virtuous circle Summarization strategies Validation & assessment Operational deployment Users & use cases Information needs

Thank you! people.dbmi.columbia.edu/noemie/harvest National Library of Medicine R01 LM National Science Foundation award #