Predicting Pneumonia & MRSA in Hospital Patients

Slides:



Advertisements
Similar presentations
Predicting Risk of Re-hospitalization for Congestive Heart Failure Patients (in collaboration with ) Jayshree Agarwal Senjuti Basu Roy, Ankur Teredesai,
Advertisements

Preventable Hospitalizations: Assessing Access and the Performance of Local Safety Net Presented by Yu Fang (Frances) Lee Feb. 9 th, 2007.
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
University of Minho School of Engineering Algoritmi Centre Uma Escola a Reinventar o Futuro – Semana da Escola de Engenharia - 24 a 27 de Outubro de 2011.
John Wieler Management Information Systems In a Healthcare Setting.
Risk management planning related to Health Information Technology
Component 1: Introduction to Health Care and Public Health in the U.S. 1.5: Unit 5: Financing Health Care (Part 2) 1.5d: Controlling Medical Expenses.
Algorithmic Approaches To Personalized Health Care Principal Investigators: I. Paschalidis and W. Adams at Boston Univ., collaborative with D. Bertsimas.
Solution Overview for NIPDEC- CDAP July 15, 2005.
Harnessing the Power of Predictive Modeling Future Trends.
Jennifer Lewis Priestley Presentation of “Assessment of Evaluation Methods for Prediction and Classification of Consumer Risk in the Credit Industry” co-authored.
Focus Area 17: Medical Product Safety Progress Review November 5, 2003.
Unit 4 Seminar MO 250 Medical Records Management: Electronic Health Records.
Decision Analytic Approaches for Evidence-Based Practice M8120 Fall 2001 Suzanne Bakken, RN, DNSc, FAAN School of Nursing & Department of Medical Informatics.
Phenotype generation from EMR by tensor factorization SEDI Durham Cohort James Lu M.D. Ph.D. Department of Electrical and Computer Engineering Department.
Special Challenges With Large Data Mining Projects CAS PREDICTIVE MODELING SEMINAR Beth Fitzgerald ISO October 2006.
Developing outcome prediction models for acute intracerebral hemorrhage patients: evaluation of a Support Vector Machine based method A. Jakab 1, L. Lánczi.
Jan 2002 EDMA The central role of the Medical Laboratory in a World of Managed Health An EDMA presentation of the benefits of in vitro testing as a basis.
Pharmacy in Public Health: Describing Populations Course, date, etc. info.
Quality Measurement A Changing Landscape
Wireless Access SSID: cwag2017
EDUCAUSE Annual Conference
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
The Peer Review Higher Weighted Diagnosis-Related Groups
Parallel Sessions: Pathways & Prediction
Logistic Regression APKC – STATS AFAC (2016).
Presented by Khawar Shakeel
An Empirical Comparison of Supervised Learning Algorithms
Group 7 Hospital Readmission Predictive Analytics
Clinical Data Exchange – Report Card
Exercise Adherence in Patients with Diabetes: Evaluating the role of psychosocial factors in managing diabetes Natalie N. Young,1, 2 Jennifer P. Friedberg,1,
Re-engineering Cardiac Catheterization at Cornell Medical Center
CCDA vs. QRDA1. CCDA vs. QRDA1 Claimed diagnoses, procedures, medications Patient Out of Pocket Take 3 diabetes measures: 1) Appropriate Testing,
Samantha Ketchin, Clinical Analyst Auckland DHB
Financial Analysis Of Electronic Health Records (EHR’s)
Presenter: Christi Melendez, RN, CPHQ
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Evaluating Sepsis Guidelines and Patient Outcomes
About the Client Challenges
NBA Draft Prediction BIT 5534 May 2nd 2018
ACO Population Health: Raising the Bar Along the Journey
The Nursing Process and Pharmacology Jeanelle F. Jimenez RN, BSN, CCRN
Fenglong Ma1, Jing Gao1, Qiuling Suo1
Medication Adherence Solution
Comparing automated mental health screening to manual processes in a health care system Josh biber.
An Inteligent System to Diabetes Prediction
PICO Power Point Presentation
MEDICAL CERTIFICATION OF Cause of death THE ROLE OF THE REVIEW COMMITTEE Samoa 2017.
Should They Stay or Should They Go
iSRD Spam Review Detection with Imbalanced Data Distributions
Current national average Impact on number of people
Chapter 33 Acute Care.
Multivariable Logistic Regression Split Cohort into Development &
Selecting the Right Predictors
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Optum’s Role in Mycare Ohio
Healthcare Business challenge
Scalable and accurate deep learning with electronic health
Current national average Impact on number of people
Model Enhanced Classification of Serious Adverse Events
Component 1: Introduction to Health Care and Public Health in the U.S.
Identifying Severe Weather Radar Characteristics
Using Your EMR for More than Just Documenting
Predicting Loan Defaults
Simplifying Healthcare
The Student’s Guide to Apache Spark
Machine learning analysis for predicting survival in stage III non-small cell lung cancer patients receiving definitive chemotherapy and proton radiation.
Credit Card Fraudulent Transaction Detection
REACHnet: Research Action for Health Network
Presentation transcript:

Predicting Pneumonia & MRSA in Hospital Patients Team Lavanya Basava Raju Ryan Borowicz Xiurong Lin Ananya Mishra Predictive Analytics MSBA 6420 12/8/16

Agenda Project Background & Goals Business Understanding Data Understanding & Data Preparation Modeling & Evaluation Deployment Conclusions & Next Steps

1.7 Million Hospital-Acquired Infections (HAI) Cause or Contribute to up to 99,000 Deaths Each Year Causes: Health care staff Contaminated equipment Bed linens Air droplets Other patients Costs: Tangible: Average of $30,000 per patient - Hospital not reimbursed Cost of preventative treatment under $500 Intangible: Patient welfare, hospital reputation and compliance issues Business Understanding: 1) Hospital Impact – Tangible (costs of patient care for re-admittance) & intangible (reputation, compliance reqs) 2) Patient Impact – sickness, loss of trust in healthcare system leads to avoiding future preventative visits 3) Additional impacts on other hospital staff members, pharmacists, etc. 4) Data mining solution will try to take proactive preventative approach to addressing problem as opposed to current reactive plan which is expensive to the hospital and bad for all parties.

Use Predictive Modeling to Lower HealthCare Provider Costs and Improve Patient Outcomes Informs Data Mining Contest 2008 Previous Research “Predicting Hospital-Acquired Infections by Scoring System with Simple Parameters” “Comparative impact of hospital-acquired infections on medical costs, length of hospital stay and outcome between community hospitals and medical centers” Goal – Build a predictive model to classify hospital patients according to their likelihood of obtaining a hospital acquired infection (HAI) of pneumonia or MRSA 1) Predicting HAI’s 2) Informs data mining contest – 2006 - Original intention was to predict likelihood of HAI and patient re-admittance within 30 days – later changed to predicting pneumonia and coming up with cost model because the data did not have accurate target variables or associated rules for determining which records were - broke contest into two portions 3) Related Research Goal: Use Cost-Sensitive Predictive Modeling to help hospitals proactively address HAI’s

Data Understanding & Preparation Consumed the Majority of Time on the Project Demographics Hospital Events Patient Conditions Data Understanding & Preparation: 1) 4 datasets – conditions, demographics, hospital visits, medications 2) Unclear target variable – ended up using all icd codes with V09 (msra) or between 480 & 486 (pneumonia) 3) Large number of missing values, incorrect data, lots of assumptions needed (with no answers available), highly time consuming data prep Medications

Lack of Integrated Data Created Challenges in Merging the Datasets for Modeling Patients between Hospital, Conditions, Medications and year not linked Data Understanding & Preparation (Cont.): 1) Issues in joining the different datasets – example of patient having 4 icd codes in conditions table, 1 of them in hospitals data, and 1 other in rx data 2) Led to having to create temporary tables for hospitals & medications datasets and using the variables from those as predictor variables – could lead to potential information loss 3) Final dataset includes large number of dummy variables – did this create any issues? Created data at patient level

Seven Predictive Modeling Techniques Were Used to Classify Patients According to their Likelihood for Infection Pre-processing Modeling Techniques Used Balanced Data Normalize Attribute Selection and Weighting Principal Component Analysis (PCA) Join to get data at patient level Ensemble Decision Tree K-NN Logistic Regression Naïve Bayes Neural Net SVM Modeling: Used R for Data Preparation and RapidMiner for Modeling 2) Discuss classification task and approach to solving 3) Discuss different models tried, parameters attempted, iteration process, etc.

Logistic Regression and SVM Provide the Best Cost-Sensitive Classification Models   Accuracy % Recall AUC F-measure Misclassification Cost Fales Positives - Non-Infected Patients Requiring Preventive Measure Decision Tree 42.62 100 0.824 53.19 442.004 83 % K-NN 34.55 0.772 48.72 482.372 95 % Logistic Regression 62.34 0.882 64.23 343.400 53 % Naïve Bayes 0.846 55.50 411.502 74 % Neural Networks 55.65 0.852 59.16 376.859 64 % SVM 63.53 0.877 64.50 337.48 55 % Marital status     Age Race No of Procedures Type of Procedures Older the person higher is the likelihood of his getting the condition. A single person is more prone to getting the condition. White people are at higher risk for this condition. Higher the number of procedures, higher is the chance of getting the condition. Type of Conditions Type of Pharmacy

Hospital savings by using the predictive models Current Costs Costs with Predictive Models Savings Non-Infected Patients Requiring Preventive Measure Logistic Regression $8,576,761 $4,900,261 42.8% 53% Support Vector Machine $5,010,261 41.6% 55% Preventive Measure $ 500 per patient Cost of Treating the Infected Patient $ 29,473 per patient

Deployment in Production Will Require a Cross-Functional Team Approach IT Department Build data pipeline to automate end-to-end processing Physicians & Nurses Use model outcomes in preventative treatment approach Data/Business Analysts Continuous evaluation and improvement of model in Production Patient Intake Staff Ensure data quality of patient responses Deployment: Day-to-day business process (discuss issue of leakage – not all variables we used in model will be available for new patients – i.e. hospital charges, diagnosis code, prescription information , etc.) Changes to IT infrastructure, data collection processes, hospital operations around intake process and patient records Potential legal/compliance issues – what permission, if any, needs to be given by patient for taking the preventive medicine? What are the impacts/potential side effects of

Improvements in Data Management Will Improve Model Outcomes Integrated Electronic Health Record (EHR) system Improved data collection and review procedures Incorporate information from medical devices Build additional health and lifestyle metrics into the model

Questions