Predicting the Outcome of Patient-Provider Communication Sequences using Recurrent Neural Networks and Probabilistic Models S38: Predictive Modeling.

Slides:



Advertisements
Similar presentations
Formulating objectives, general and specific
Advertisements

Dual interviews: Moving Beyond Didactics to Train Primary Care Providers in the Biopsychosocial Model James Anderson, PhD Fellow in Primary Care Psychology.
Learning from Imbalanced, Only Positive and Unlabeled Data Yetian Chen
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.
A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
S14: Interpretable Probabilistic Latent Variable Models for Automatic Annotation of Clinical Text Alexander Kotov 1, Mehedi Hasan 1, April.
Can Pretty People Have Their Cake and Eat it Too? Positive and Negative Effects of Physical Attractiveness. Megan M. Schad, David E. Szwedo, Joanna M.
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation EMNLP’14 paper by Kyunghyun Cho, et al.
Division of HIV/AIDS Managing Questionnaire Development for a National HIV Surveillance Survey, Medical Monitoring Project Jennifer L Fagan, Health Scientist/Interview.
Caroline Clements Project lead, Professor Nav Kapur
Jonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C
Job Analysis Chapter 4 Md. Al-Amin.
Scalable EEG interpretation using Deep Learning and Schema Descriptors
CS 388: Natural Language Processing: LSTM Recurrent Neural Networks
CS 4501: Introduction to Computer Vision Computer Vision + Natural Language Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy / Justin Johnson.
Online Multiscale Dynamic Topic Models
Stephen Joseph Galli, MD  Journal of Allergy and Clinical Immunology 
Deep Learning Amin Sobhani.
Maintaining Recruitment and Informed Consent in the Later Stages of a Trial: the By-Band-Sleeve Study Paul Whybrow, Sangeetha Paramasivan, Jane Blazeby,
Introduction to Evaluation
School of Computer Science & Engineering
Online Conditional Outlier Detection in Nonstationary Time Series
Recurrent Neural Networks for Natural Language Processing
Session 1 – Study Objectives
Adversarial Learning for Neural Dialogue Generation
David L. Bell, MD, MPH Assistant Clinical Professor of Pediatrics
Speaker Institution Twitter: #AMIA2017
Contraception Cases Michelle M. Forcier, MD
Cultural Competence and Consumer Involvement: Practice and Theory
S117: Acute Setting Predictive Analytics Sharon E. Davis, MS
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Deep Learning: Model Summary
April Idalski Carcone, Ph.D., M.S.W.
Machine learning in Action: Unpacking the Biographical Questionnaire
Design and Development of mHealth Applications S81 Scott McGrath, MS
Intro to NLP and Deep Learning
Different Units Ramakrishna Vedantam.
Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning Shizhu He, Cao liu, Kang Liu and Jun Zhao.
Ying He Wuhan University of Technology Twitter: #AMIA2017
Ying He Wuhan University of Technology
Efficient Estimation of Word Representation in Vector Space
Fenglong Ma1, Jing Gao1, Qiuling Suo1
A critical review of RNN for sequence learning Zachary C
Real-time Protection for Open Beacon Network
Advanced Artificial Intelligence
Paraphrase Generation Using Deep Learning
Image Captions With Deep Learning Yulia Kogan & Ron Shiff
Joseph D. Romano, MPhil Columbia University Twitter: #AMIA2017 #S23
Collen Award Honoree Session S77
The Big Health Data–Intelligent Machine Paradox
Other Classification Models: Recurrent Neural Network (RNN)
Lecture 16: Recurrent Neural Networks (RNNs)
Ying Dai Faculty of software and information science,
Resource Recommendation for AAN
View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.
ADVANCED ANOMALY DETECTION IN CANARY TESTING
Report by: 陆纪圆.
Attention.
RegionAl: an Optimized Regional Classifier to Predict Mortality in
Presented by: Anurag Paul
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
S04: Machine Learning in Clinical Predictive Modeling I
Cengizhan Can Phoebe de Nooijer
DNN-BASED SPEAKER-ADAPTIVE POSTFILTERING WITH LIMITED ADAPTATION DATA FOR STATISTICAL SPEECH SYNTHESIS SYSTEMS Mirac Goksu Ozturk1, Okan Ulusoy1, Cenk.
GhostLink: Latent Network Inference for Influence-aware Recommendation
LHC beam mode classification
Neural Machine Translation by Jointly Learning to Align and Translate
Presentation transcript:

Predicting the Outcome of Patient-Provider Communication Sequences using Recurrent Neural Networks and Probabilistic Models S38: Predictive Modeling for Clinical Outcomes and Beyond Mehedi Hasan, Alexander Kotov, April Idalski Carcone, Ming Dong, Sylvie Naar Wayne State University, Detroit, MI

Disclosure I and my co-authors as well as my and their spouses/partners have no relevant relationships with commercial interests to disclose. AMIA 2018 Informatics Summit | amia.org

Learning Objectives After participating in this session the learner should be better able to: Formulate the problem of predicting the likelihood of eliciting a certain type of behavioral response during motivational interview as a sequence classification problem Understand how probabilistic and deep learning based methods can be applied to address this problem Learn how these methods can be applied to monitor the progression of motivational interviews and predicting the likelihood of eliciting “change talk” AMIA 2018 Informatics Summit | amia.org

Motivation The problem of analyzing temporally ordered sequences of observations generated by molecular, physiological and psychological processes to make predictions regarding the outcome of these processes arises in many domains of clinical informatics We focus on predicting the outcome of patient-provider communication exchanges in the context of a clinical dialog Automated methods to estimate the likelihood of eliciting a particular behavioral response from a patient based on a sequence of coded patient- provider communication exchanges Such methods can be used to help providers monitor progression of a clinical dialog in real-time AMIA 2018 Informatics Summit | amia.org

Study Context We focus on transcripts of Motivational Interviews (MI) with obese adolescents conducted by the counselors at the Department of Family Medicine and Public Health Sciences at Wayne State University In our previous work, we proposed and evaluated machine learning methods for automated annotation of MI transcripts with behavior codes: Alexander Kotov, Mehedi Hasan, April Carcone et al. "Interpretable Probabilistic Latent Variable Models for Automatic Annotation of Clinical Text", In Proceedings of the 2015 Annual Symposium of the American Medical Informatics Association (AMIA'15), pages 785-794 Mehedi Hasan, Alexander Kotov, April Idalski Carcone et al., "A Study of the Effectiveness of Machine Learning Methods for Classification of Clinical Interview Fragments into a Large Number of Categories", In the Journal of Biomedical Informatics (JBI), Volume 62, August 2016, pages 21-31 In this work, we focus on machine learning methods to predict the outcome of MI interviews (eliciting patient’s motivational statements, a.k.a. “change talk”) AMIA 2018 Informatics Summit | amia.org

Study Data: A Fragment of Motivational Interview Transcript Code Behavior Speaker Utterance SS Structure Session Counselor Okay. Can I meet with Xxxx alone for a few minutes? OQO Open-ended question, other So, Xxxx, how you doing? HUPO High uptake, other Adolescent Fine OQTBN Open-ended question, target behavior neutral That’s good. So, tell me how do you feel about your weight? CHT+ Change talk positive It’s not the best. CQECHT+ Closed question, elicit change talk positive It’s not the best? Yeah CQTBN Closed question, target behavior normal Okay, so have you tried to lose weight before? HUPW High uptake, weight Yes AMIA 2018 Informatics Summit | amia.org

Study Data: Motivational Interviews 129 motivational interview transcripts that include 50,239 segmented and annotated utterances 5,143 observed sequences 4,225 or 82.15% were positive Only 918 or 17.85% were negative Dealing with imbalanced data: Oversampling: new synthetic examples are generated for minority classes at the borderline between the majority and minority classes Undersampling: the number of samples in majority class was reduced by replacing the clusters of samples identified by the k-means clustering algorithm with the cluster centroids AMIA 2018 Informatics Summit | amia.org

Methods: Probabilistic Models Markov Chain (MC) Hidden Markov Model (HMM) SS HUPO OQO CHT+ 0.7 0.6 0.1 0.9 0.4 SS SO GINFO+ HUPO LUP+ HUPW OQO CQO CHT+ CML+ 0.7 0.6 0.1 0.9 0.4 0.3 0.3 AMIA 2018 Informatics Summit | amia.org

Methods: Probabilistic Models Transition probability from behavior code 𝑐 𝑖 to 𝑐 𝑗 in model M: Probability that a sequence of behavior codes S was generated by model M: Probability of successful outcome corresponds to the odds ratio of generating a sequence S from the models of successful versus unsuccessful interviews: AMIA 2018 Informatics Summit | amia.org

Methods: Recurrent Neural Networks Able to capture long-term dependencies between observations in a sequence Allow information to be passed between temporally separated observations in a sequence Have a memory, which can be reset RNNs have demonstrated remarkable results for NLP tasks, such as machine translation Employed RNN methods: Long Short Term Memory (LSTM) Gated Recurrent Unit (GRU) AMIA 2018 Informatics Summit | amia.org

Methods: Recurrent Neural Networks Loss AMIA 2018 Informatics Summit | amia.org

Methods: Behavior Code Embeddings Embeddings (i.e. real-valued low- dimensional vector representations) of behavior codes are produced as a byproduct of training RNNs Embeddings of similar codes are close to each other in low-dimensional space. The behaviors that tend to elicit CHT+/CML+ group together, whereas the behaviors that tend to elicit CHT-/CML- also group together and are located in the opposite regions of semantic space. 2D embedding diagram produced by t-SNE AMIA 2018 Informatics Summit | amia.org

Results: Model Performance Method Under-sampling Over-sampling Precision Recall F1-Score Markov Chain 1st Order 0.7060 0.7044 0.7038 0.7932 0.7799 0.7775 Markov Chain 2nd Order 0.6395 0.6385 0.6379 0.7111 0.7029 0.7000 Hidden Markov Model 0.6244 0.6143 0.6067 0.7567 0.7520 LSTM 0.8672 0.8626 0.8622 0.8411 0.8372 0.8368 LSTM-TR 0.8733 0.8681 0.8677 0.8424 0.8385 0.8381 GRU 0.8674 0.8648 0.8646 0.8379 0.8342 0.8337 GRU-TR 0.8705 0.8676 0.8673 0.8412 0.8377 0.8373 AMIA 2018 Informatics Summit | amia.org

Conclusion The task of predicting the outcome of patient-provider communication exchanges can be framed as a sequence classification problem RNNs significantly outperform probabilistic models for this problem LSTM with target replication as a training strategy is the most accurate predictor of the outcome of communication exchanges in clinical interviews Deep learning methods can be successfully applied for real-time monitoring of the progression of motivational interviews and establishing causal relationships between different provider communication strategies and patient behaviors AMIA 2018 Informatics Summit | amia.org

Future Research Directions Apply the proposed methods to other types of behavioral interventions or clinical interviews Automated methods to identify the most effective provider communication strategies to elicit a specific type of behavioral response from a given demographic group of patients (e.g. African-American adolescents) AMIA 2018 Informatics Summit | amia.org

Acknowledgments Advisor: Other collaborators: Funding: Dr. Alexander Kotov Textual Data Analytics Lab Wayne State University Other collaborators: Dr. April Carcone Dr. Ming Dong Dr. Sylvie Naar Funding: National Institute of Health (NIDDK) R21DK108071 Department of Family Medicine and Public Health members: Student assistants Research staff AMIA 2018 Informatics Summit | amia.org

Thank you! Questions?