Predicting the Outcome of Patient-Provider Communication Sequences using Recurrent Neural Networks and Probabilistic Models S38: Predictive Modeling.

Slides:

Advertisements

Similar presentations

Formulating objectives, general and specific

Advertisements

Dual interviews: Moving Beyond Didactics to Train Primary Care Providers in the Biopsychosocial Model James Anderson, PhD Fellow in Primary Care Psychology.

Learning from Imbalanced, Only Positive and Unlabeled Data Yetian Chen

Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.

Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.

A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.

 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.

S14: Interpretable Probabilistic Latent Variable Models for Automatic Annotation of Clinical Text Alexander Kotov 1, Mehedi Hasan 1, April.

Can Pretty People Have Their Cake and Eat it Too? Positive and Negative Effects of Physical Attractiveness. Megan M. Schad, David E. Szwedo, Joanna M.

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation EMNLP’14 paper by Kyunghyun Cho, et al.

Division of HIV/AIDS Managing Questionnaire Development for a National HIV Surveillance Survey, Medical Monitoring Project Jennifer L Fagan, Health Scientist/Interview.

Caroline Clements Project lead, Professor Nav Kapur

Jonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C

Job Analysis Chapter 4 Md. Al-Amin.

Scalable EEG interpretation using Deep Learning and Schema Descriptors

CS 388: Natural Language Processing: LSTM Recurrent Neural Networks

CS 4501: Introduction to Computer Vision Computer Vision + Natural Language Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy / Justin Johnson.

Online Multiscale Dynamic Topic Models

Stephen Joseph Galli, MD Journal of Allergy and Clinical Immunology

Deep Learning Amin Sobhani.

Maintaining Recruitment and Informed Consent in the Later Stages of a Trial: the By-Band-Sleeve Study Paul Whybrow, Sangeetha Paramasivan, Jane Blazeby,

Introduction to Evaluation

School of Computer Science & Engineering

Online Conditional Outlier Detection in Nonstationary Time Series

Recurrent Neural Networks for Natural Language Processing

Session 1 – Study Objectives

Adversarial Learning for Neural Dialogue Generation

David L. Bell, MD, MPH Assistant Clinical Professor of Pediatrics

Speaker Institution Twitter: #AMIA2017

Contraception Cases Michelle M. Forcier, MD

Cultural Competence and Consumer Involvement: Practice and Theory

S117: Acute Setting Predictive Analytics Sharon E. Davis, MS

Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD

Deep Learning: Model Summary

April Idalski Carcone, Ph.D., M.S.W.

Machine learning in Action: Unpacking the Biographical Questionnaire

Design and Development of mHealth Applications S81 Scott McGrath, MS

Intro to NLP and Deep Learning

Different Units Ramakrishna Vedantam.

Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning Shizhu He, Cao liu, Kang Liu and Jun Zhao.

Ying He Wuhan University of Technology Twitter: #AMIA2017

Ying He Wuhan University of Technology

Efficient Estimation of Word Representation in Vector Space

Fenglong Ma1, Jing Gao1, Qiuling Suo1

A critical review of RNN for sequence learning Zachary C

Real-time Protection for Open Beacon Network

Advanced Artificial Intelligence

Paraphrase Generation Using Deep Learning

Image Captions With Deep Learning Yulia Kogan & Ron Shiff

Joseph D. Romano, MPhil Columbia University Twitter: #AMIA2017 #S23

Collen Award Honoree Session S77

The Big Health Data–Intelligent Machine Paradox

Other Classification Models: Recurrent Neural Network (RNN)

Lecture 16: Recurrent Neural Networks (RNNs)

Ying Dai Faculty of software and information science,

Resource Recommendation for AAN

View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.

ADVANCED ANOMALY DETECTION IN CANARY TESTING

Report by: 陆纪圆.

RegionAl: an Optimized Regional Classifier to Predict Mortality in

Presented by: Anurag Paul

Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.

S04: Machine Learning in Clinical Predictive Modeling I

Cengizhan Can Phoebe de Nooijer

DNN-BASED SPEAKER-ADAPTIVE POSTFILTERING WITH LIMITED ADAPTATION DATA FOR STATISTICAL SPEECH SYNTHESIS SYSTEMS Mirac Goksu Ozturk1, Okan Ulusoy1, Cenk.

GhostLink: Latent Network Inference for Influence-aware Recommendation

LHC beam mode classification

Neural Machine Translation by Jointly Learning to Align and Translate

Presentation transcript:

Predicting the Outcome of Patient-Provider Communication Sequences using Recurrent Neural Networks and Probabilistic Models S38: Predictive Modeling for Clinical Outcomes and Beyond Mehedi Hasan, Alexander Kotov, April Idalski Carcone, Ming Dong, Sylvie Naar Wayne State University, Detroit, MI

Disclosure I and my co-authors as well as my and their spouses/partners have no relevant relationships with commercial interests to disclose. AMIA 2018 Informatics Summit | amia.org

Learning Objectives After participating in this session the learner should be better able to: Formulate the problem of predicting the likelihood of eliciting a certain type of behavioral response during motivational interview as a sequence classification problem Understand how probabilistic and deep learning based methods can be applied to address this problem Learn how these methods can be applied to monitor the progression of motivational interviews and predicting the likelihood of eliciting “change talk” AMIA 2018 Informatics Summit | amia.org

Motivation The problem of analyzing temporally ordered sequences of observations generated by molecular, physiological and psychological processes to make predictions regarding the outcome of these processes arises in many domains of clinical informatics We focus on predicting the outcome of patient-provider communication exchanges in the context of a clinical dialog Automated methods to estimate the likelihood of eliciting a particular behavioral response from a patient based on a sequence of coded patient- provider communication exchanges Such methods can be used to help providers monitor progression of a clinical dialog in real-time AMIA 2018 Informatics Summit | amia.org

Study Context We focus on transcripts of Motivational Interviews (MI) with obese adolescents conducted by the counselors at the Department of Family Medicine and Public Health Sciences at Wayne State University In our previous work, we proposed and evaluated machine learning methods for automated annotation of MI transcripts with behavior codes: Alexander Kotov, Mehedi Hasan, April Carcone et al. "Interpretable Probabilistic Latent Variable Models for Automatic Annotation of Clinical Text", In Proceedings of the 2015 Annual Symposium of the American Medical Informatics Association (AMIA'15), pages 785-794 Mehedi Hasan, Alexander Kotov, April Idalski Carcone et al., "A Study of the Effectiveness of Machine Learning Methods for Classification of Clinical Interview Fragments into a Large Number of Categories", In the Journal of Biomedical Informatics (JBI), Volume 62, August 2016, pages 21-31 In this work, we focus on machine learning methods to predict the outcome of MI interviews (eliciting patient’s motivational statements, a.k.a. “change talk”) AMIA 2018 Informatics Summit | amia.org

Study Data: A Fragment of Motivational Interview Transcript Code Behavior Speaker Utterance SS Structure Session Counselor Okay. Can I meet with Xxxx alone for a few minutes? OQO Open-ended question, other So, Xxxx, how you doing? HUPO High uptake, other Adolescent Fine OQTBN Open-ended question, target behavior neutral That’s good. So, tell me how do you feel about your weight? CHT+ Change talk positive It’s not the best. CQECHT+ Closed question, elicit change talk positive It’s not the best? Yeah CQTBN Closed question, target behavior normal Okay, so have you tried to lose weight before? HUPW High uptake, weight Yes AMIA 2018 Informatics Summit | amia.org

Study Data: Motivational Interviews 129 motivational interview transcripts that include 50,239 segmented and annotated utterances 5,143 observed sequences 4,225 or 82.15% were positive Only 918 or 17.85% were negative Dealing with imbalanced data: Oversampling: new synthetic examples are generated for minority classes at the borderline between the majority and minority classes Undersampling: the number of samples in majority class was reduced by replacing the clusters of samples identified by the k-means clustering algorithm with the cluster centroids AMIA 2018 Informatics Summit | amia.org

Methods: Probabilistic Models Markov Chain (MC) Hidden Markov Model (HMM) SS HUPO OQO CHT+ 0.7 0.6 0.1 0.9 0.4 SS SO GINFO+ HUPO LUP+ HUPW OQO CQO CHT+ CML+ 0.7 0.6 0.1 0.9 0.4 0.3 0.3 AMIA 2018 Informatics Summit | amia.org

Methods: Probabilistic Models Transition probability from behavior code 𝑐 𝑖 to 𝑐 𝑗 in model M: Probability that a sequence of behavior codes S was generated by model M: Probability of successful outcome corresponds to the odds ratio of generating a sequence S from the models of successful versus unsuccessful interviews: AMIA 2018 Informatics Summit | amia.org

Methods: Recurrent Neural Networks Able to capture long-term dependencies between observations in a sequence Allow information to be passed between temporally separated observations in a sequence Have a memory, which can be reset RNNs have demonstrated remarkable results for NLP tasks, such as machine translation Employed RNN methods: Long Short Term Memory (LSTM) Gated Recurrent Unit (GRU) AMIA 2018 Informatics Summit | amia.org

Methods: Recurrent Neural Networks Loss AMIA 2018 Informatics Summit | amia.org

Methods: Behavior Code Embeddings Embeddings (i.e. real-valued low- dimensional vector representations) of behavior codes are produced as a byproduct of training RNNs Embeddings of similar codes are close to each other in low-dimensional space. The behaviors that tend to elicit CHT+/CML+ group together, whereas the behaviors that tend to elicit CHT-/CML- also group together and are located in the opposite regions of semantic space. 2D embedding diagram produced by t-SNE AMIA 2018 Informatics Summit | amia.org

Results: Model Performance Method Under-sampling Over-sampling Precision Recall F1-Score Markov Chain 1st Order 0.7060 0.7044 0.7038 0.7932 0.7799 0.7775 Markov Chain 2nd Order 0.6395 0.6385 0.6379 0.7111 0.7029 0.7000 Hidden Markov Model 0.6244 0.6143 0.6067 0.7567 0.7520 LSTM 0.8672 0.8626 0.8622 0.8411 0.8372 0.8368 LSTM-TR 0.8733 0.8681 0.8677 0.8424 0.8385 0.8381 GRU 0.8674 0.8648 0.8646 0.8379 0.8342 0.8337 GRU-TR 0.8705 0.8676 0.8673 0.8412 0.8377 0.8373 AMIA 2018 Informatics Summit | amia.org

Conclusion The task of predicting the outcome of patient-provider communication exchanges can be framed as a sequence classification problem RNNs significantly outperform probabilistic models for this problem LSTM with target replication as a training strategy is the most accurate predictor of the outcome of communication exchanges in clinical interviews Deep learning methods can be successfully applied for real-time monitoring of the progression of motivational interviews and establishing causal relationships between different provider communication strategies and patient behaviors AMIA 2018 Informatics Summit | amia.org

Future Research Directions Apply the proposed methods to other types of behavioral interventions or clinical interviews Automated methods to identify the most effective provider communication strategies to elicit a specific type of behavioral response from a given demographic group of patients (e.g. African-American adolescents) AMIA 2018 Informatics Summit | amia.org

Acknowledgments Advisor: Other collaborators: Funding: Dr. Alexander Kotov Textual Data Analytics Lab Wayne State University Other collaborators: Dr. April Carcone Dr. Ming Dong Dr. Sylvie Naar Funding: National Institute of Health (NIDDK) R21DK108071 Department of Family Medicine and Public Health members: Student assistants Research staff AMIA 2018 Informatics Summit | amia.org

Thank you! Questions?