Joseph D. Romano, MPhil Columbia University Twitter: #AMIA2017 #S23

Slides:



Advertisements
Similar presentations
The Community Engagement Studio: Strengthening Research Capacity through Community Engagement Consuelo H. Wilkins, MD, MSCI Executive Director, Meharry.
Advertisements

Cost Assessment of Collaborative Healthcare
Harvard Catalyst: How Can It Help Your Research? Steven D. Freedman MD PhD Professor of Medicine, HMS Associate Dean for Clinical and Translational Research,
Dual interviews: Moving Beyond Didactics to Train Primary Care Providers in the Biopsychosocial Model James Anderson, PhD Fellow in Primary Care Psychology.
Evaluating A Patient-Centered Medical Home from the Patient’s Perspective Betty M. Kennedy, PhD Community Outreach Specialist Community Outreach Specialist.
Electronic Health Records and Clinical Decision Support Systems Impact on National Ambulatory Care Quality Max J. Romano, BA; Randall S. Stafford, MD,
Mary T. Kelleher, MS Faculty, Chicago Center for Family Health Tai J. Mendenhall, PhD Asst. Professor, Dept. of Family Social Science, University of Minnesota.
Program Evaluation Principles and Applications PAS 2010.
Amar K. Das, MD, PhD Associate Professor of Biomedical Data Science, Psychiatry and Health Policy & Clinical Practice Geisel School of Medicine at Dartmouth.
Consumer Control & the Independent Living Philosophy.
Identification of Sexual Health Variables SHWG members compiled 49 sexual health terms that were used to query the RDW using RedX Terms were categorized.
Amia.org Session Title Twitter: #iHealth16 Speaker Institution.
Bassem Makni SML 16 Click to add text 1 Deep Learning of RDF rules Semantic Machine Learning.
Welcome deep loria !.
Learning linguistic structure with simple and more complex recurrent neural networks Psychology February 2, 2017.
Scalable EEG interpretation using Deep Learning and Schema Descriptors
Deep Learning for Bacteria Event Identification
Dorota Kilańska RN, PhD European Nursing Research Foundation (ENRF)
Formal Conceptualization of Dental Diagnoses: Status Report
An Artificial Intelligence Approach to Precision Oncology
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Interoperable Social Determinants of Health: LOINCing the PCAM
Transforming Clerkships into an Integrated Longitudinal Learning Module: Model Madres: Maternal, Child and Family Health Adriana Padilla, MD Susan Hughes,
Speaker Institution Twitter: #AMIA2017
Department of Family and Community Medicine
Bret J. Gardner University of Nebraska Medical Center
Evan Sholle, MS Weill Cornell Medicine Twitter: #AMIA2017
Lincoln R. Sheets, MD, PhD University of Missouri Twitter: #AMIA2017
Shan He, PhD Intermountain Healthcare
Hong Kang, PhD (Presenter) Zhiguo Yu, PhD Yang Gong, MD, PhD
Intro to NLP and Deep Learning
Natural Language Processing of Knee MRI Reports
Chapter 16 Nursing Informatics: Improving Workflow and Meaningful Use
Ying He Wuhan University of Technology Twitter: #AMIA2017
Ying He Wuhan University of Technology
Predicting the Outcome of Patient-Provider Communication Sequences using Recurrent Neural Networks and Probabilistic Models S38: Predictive Modeling.
Fenglong Ma1, Jing Gao1, Qiuling Suo1
Using Tensorflow to Detect Objects in an Image
Working with Co-Design Whakakotahi Learning Session One 23 May 2017
Goodfellow: Chap 6 Deep Feedforward Networks
Real-time Protection for Open Beacon Network
NDARC Annual Symposium
Vessel Extraction in X-Ray Angiograms Using Deep Learning
Certified Professional Patient Navigator CPPN
Final Presentation: Neural Network Doc Summarization
Learning linguistic structure with simple and more complex recurrent neural networks Psychology February 8, 2018.
Connect with CSER for news & networking:
Predicting Pneumonia & MRSA in Hospital Patients
Speaker Institution Twitter: #AMIA2018
Clinical and Translational Science Awards Program
Word embeddings based mapping
Leading professional association for infection preventionists (IPs)
Word embeddings based mapping
Neural Networks Geoff Hulten.
Neural Speech Synthesis with Transformer Network
Resource Recommendation for AAN
A “bottom up” approach to curating electronic order sets
The Center for Nursing Research Ochsner Health System December 2015
Legislative Origins 2014 Protecting Access to Medicare Act
Speaker Name, Degree Position, Institution Date
April 18th 2018 Moderator: Matthew Rioth
TensorFlow: A System for Large-Scale Machine Learning
Margaret R. Punch, MD Michigan Medicine
The Research Question Continuity of care: does having the same primary care provider over time matter? S.T. Wong, A. Katz, Peterson, S., & Taylor, C. Does.
Automatic Handwriting Generation
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
S04: Machine Learning in Clinical Predictive Modeling I
Nursing informatics Lecture (11).
Moving from Health Care to Life Care
National Case Management Week
Presentation transcript:

Joseph D. Romano, MPhil Columbia University Twitter: #AMIA2017 #S23 Deep recurrent neural networks identify transgender patients Oral Presentations – Methods for Identification, Classification, and Association using EHR Data S23 Joseph D. Romano, MPhil Columbia University Twitter: #AMIA2017 #S23

Disclosure I have no relevant relationships with commercial interests to disclose. AMIA 2017 | amia.org

Learning Objectives After participating in this session the learner should be better able to: Conceptualize a recurrent neural network text classifier, and see how it can be applied to transgender patient classification. Understand the need for data-driven methods to improve healthcare for transgender patients. Understand that deep learning models do not address the ethical issues presented by tasks such as transgender status classification. AMIA 2017 | amia.org

The transgender health crisis Transgender individuals experience unique health disparities Lacking adequate subpopulation research due to historical stigmatization Health care professionals often untrained in LGBT health Specific physical and psychological comorbidities more common It is challenging to identify retrospective transgender cohorts ‘Transgender’ often not coded in health information systems Fear of stigmatization may lead to lack of disclosure Increased privacy concerns, particularly regarding EHR data Institute of Medicine (US). National Academies Press (US);2011. (PMID: 22013611) Kenrick approached us Based on some of his previous work AMIA 2017 | amia.org

Recurrent Neural Networks Accepts an ordered sequence as input In our case, a sequence of embedded words Returns a sequence as output For sequence classification, discard all but the last item in the output sequence http://colah.github.io/posts/2015-08-Understanding-LSTMs/ What is a neural network? AMIA 2017 | amia.org

Vectorizing words via embedding Mikolov, T et. al. NIPS. 2013;23:3111-3119. 2 primary benefits: - reduces dimensionality - models correlations between words AMIA 2017 | amia.org

Note classification pipeline AMIA 2017 | amia.org

Implementation LSTM network written in Keras (Tensorflow back-end) Embedding layer  LSTM layer  Fully connected layer Embedding dimensionality: 64 LSTM output dimensionality: 100 Activation functions: LSTM layer: Hard sigmoid Fully connected layer: Sigmoid 578,101 free parameters Trained on CentOS Linux server with 4x Nvidia Tesla P100 GPU Accelerators 14,336 total CUDA cores AMIA 2017 | amia.org

Implementation Targets Inputs AMIA 2017 | amia.org

Results: Cohort and note characteristics EHR Cohort Cases: 39 manually-identified transgender patients Controls: 400 randomly selected patients with clinical notes Free-text clinical notes Obtained all notes for included patients Tokenized; removed numbers, proper nouns, punctuation Left-pad/truncate notes to 1000 words 33/67% train-test split Each patients’ notes in either train or test set, never both Train word2vec embeddings on entire set of notes AMIA 2017 | amia.org

Results: Classifier performance Metric Score Accuracy 0.901 Precision 0.830 Recall 0.737 F1-Score 0.780 AUC ROC 0.940 AMIA 2017 | amia.org

Results: Word embeddings AMIA 2017 | amia.org

Results: Accuracy and training loss Training epoch: 1 2 3 4 5 Model trains (5 epochs) in 20-25 minutes AMIA 2017 | amia.org

Comparison to stroke classification Acute ischemic stroke Metric Score Accuracy 0.901 AUC ROC 0.940 AMIA 2017 | amia.org

Limitations and future improvements We need far more data! 37 patients so far–we must be overfitting How do we find more patients? grep approach is primitive Leverage emerging techniques to extract knowledge from the learned networks Neural networks are hard to introspect; no “beta coefficient” equivalent Eventually, incorporate into clinical decision support See a clinical note, evaluate, trigger alert if likely transgender AMIA 2017 | amia.org

Application to multiple institutions Does our model translate to other hospital systems? If not, how about the word embeddings? Major opportunity to improve training data size issues Different institutions/EHR systems implement gender differently NYP/Weill Cornell Medical Center: patient-reported gender with transgender options Stuck in IRB purgatory Use cutting-edge techniques to advance privacy guarantees Generative Adversarial Networks and/or Variational Autoencoders Differential privacy analysis AMIA 2017 | amia.org

Ethical considerations Essential to address the ethical concerns associated with automated identification of transgender patients Misuse could lead to patient discrimination Reidentification of training patients may be possible Gender is complicated, and imposing labels on patients may be counterproductive See S57: Oral Presentations, first presentation: “The Use of Informatics to Reduce Disparities in Transgender Health” (Kenrick Cato, PhD, RN) 8:30 AM-8:48 AM; Tuesday (Fairchild) Cato, K et. al. J Empir Res Hum Res Ethics. 2016;11(3):214-219. AMIA 2017 | amia.org

Acknowledgements Tatonetti Lab Kenrick Cato, PhD* Nicholas Tatonetti, PhD* Rami Vanguri, PhD* Kayla Quinnies, PhD Theresa Koleck, PhD Yun Hao Phyllis Thangaraj Alexandre Yahi Fernanda Polubriaginof, MD Nick Giangreco Jenna Kefeli Jing Ai Katie LaRow Kenrick Cato, PhD* *Coauthors AMIA 2017 | amia.org

AMIA is the professional home for more than 5,400 informatics professionals, representing frontline clinicians, researchers, public health experts and educators who bring meaning to data, manage information and generate new knowledge across the research and healthcare enterprise. AMIA 2017 | amia.org

Email me at: jdr2160@cumc.columbia.edu Thank you! Email me at: jdr2160@cumc.columbia.edu