Elizabeth Karlson, MD Associate Professor of Medicine

Slides:



Advertisements
Similar presentations
Hatem H Eleishi, MD Professor of Rheumatology, Cairo University Consultant Rheumatologist, Dr. Soliman Fakeeh Hospital Rheumatoid Arthritis Wednesday,
Advertisements

Sep 13 Fall 05 Electronic Medical Records (EMR) Computerized Patient Record (CPR) Electronic Health Record (EHR)
Treatment Management of Rheumatoid Arthritis. Findings and In Depth Analysis.
A quantitative approach to accurate classification of RA. Tom Huizinga.
Pragmatic use of data for research, audit and revalidation Will Dixon Health Informatics Unit 10 th anniversary Royal College of Physicians, 15 th Sept.
Journal Club Alcohol, Other Drugs, and Health: Current Evidence May–June 2010.
DATA Role of data in QI and Scholarship Characteristics of “good” data Sources/categories of data Administrative databases – pros &cons New Informatics.
Preceptor: Louise A. Mawn, M.D. May 30, Medical Documentation Medical record serves many functions For health care providers it facilitates: Communication.
Reuse of Electronic Medical Records for Research Our architecture Two examples.
Data Collection in Private Practice and Implementation with Electronic Medical Records Martin J Bergman, MD Chief—Rheumatology Taylor Hospital Ridley Park,
1 Knowledge Management for Disease Coding (KMDC): Background & Introduction Timothy Hays, Ph.D. Project Manager, Knowledge Management for Disease Coding.
Potential Roles and Limitations of Biomarkers in Alzheimer’s Disease Richard Mayeux, MD, MSc Columbia University.
Model and Variable Selections for Personalized Medicine Lu Tian (Northwestern University) Hajime Uno (Kitasato University) Tianxi Cai, Els Goetghebeur,
What is the definition of remission in RA? American Rheumatism Association (ARA)¹ Five or more of the following criteria must be fulfilled for at least.
Clinical computing and the repository George Hripcsak Jim Cimino Pete Stetson.
Remission in Established RA Patients
Periodontal Disease Risk Developed by PreViser, Inc. Courtesy PreViser Corporation, all rights reserved.
Measuring Signs and Symptoms in Rheumatoid Arthritis David R. Karp, MD, PhD Chief, Rheumatic Diseases UT Southwestern Medical Center David R. Karp, MD,
Identifying RA patients from the electronic medical records at Partners HealthCare Robert Plenge, M.D., Ph.D. VA Hospital July 20, 2010 HARVARD MEDICAL.
1 Demographic and Clinical Characteristics of Rheumatoid Arthritis patients in some Arab States: Preliminary analysis Mohammed Hammoudeh,M.D Division of.
Psoriatic arthritis – definition and classification criteria Philip Helliwell Senior Lecturer in Rheumatology University of Leeds.
Rheumatoid Arthritis Anila Malik GPVTS. Aims To cover the following: What is RA? Diagnostic criteria and clinical features Rheumatoid Factor Investigations.
Clinical Registries Needs and Solutions Dr. Peter Greene, CMIO Diana Gumas, IT Director 1.
LARA MANGRAVITE SAGE BIONETWORKS ON BEHALF OF THE RA CHALLENGE ORGANIZING TEAM The DREAM Rheumatoid Arthritis Responder Challenge: Motivation, Data, Scoring.
Research Techniques Made Simple: Databases for Clinical Research Katrina Abuabara, MD MA David Margolis, MD PhD University of Pennsylvania.
Butte Lab Journal Club 16 Aug 2010 Alexander A. Morgan.
Shared Health Research Information Network Andrew McMurry, MS SHRINE Architect Harvard Medical School Center for BioMedical Informatics Children’s Hospital.
Consent2Share Linking Cohort Discovery to Consent David R Nelson MD Assistant Vice President for Research Professor of Medicine Director, Clinical and.
Crowdsourcing pharmacogenomic data analysis: PGRN-Sage RA Responder Challenge PGRN Spring Meeting April 30, 2013 HARVARD MEDICAL SCHOOL.
Square wheels: electronic medical records for discovery research in rheumatoid arthritis Robert M. Plenge, M.D., Ph.D. October 30, 2009 NCRR sponsored.
SHARPn High-Throughput Phenotyping (HTP) November 18, 2013.
Open Health Natural Language Processing Consortium (OHNLP)
Underdiagnosis of Pediatric Hypertension – An Example of the Potential of Electronic Medical Record Research for Clinical Pediatricians David C Kaelber,
1 Journal Club Alcohol, Other Drugs, and Health: Current Evidence March–April 2012.
Criteria to assess quality of observational studies evaluating the incidence, prevalence, and risk factors of chronic diseases Minnesota EPC Clinical Epidemiology.
Using EPR2020 for Clinical Research Diana Gumas, Director of Clinical and Clinical Research IT 1.
De-identification: A Critical Success Factor in Clinical and Population Research Steven Merahn MD Dee Lang, RHIT Prepared for 2007 APIII Pittsburgh, PA.
Rheumatology teaching session GP ST2 year 8/9/10.
1 Assessing Disease Activity Janice Booth Rheumatology Nurse Practitioner January 2012.
Studying Health Care: Some ICD-10 Tools Hude Quan, Nicole Fehr, Leslie Roos University of Calgary and Manitoba Centre for Health Policy.
Consultant Rheumatologist
Finding a Predictive Model for Post-Hospitalization Adverse Events Henry Carretta 1, PhD, MPH; Katrina McAfee 1,2, MS; Dennis Tsilimingras 1,3, MD, MPH.
Phenotype generation from EMR by tensor factorization SEDI Durham Cohort James Lu M.D. Ph.D. Department of Electrical and Computer Engineering Department.
Infectious arthritis Bacterial Viral Other Postinfectious (reactive) arthritis Rheumatic fever Reactive arthritis Enteric infection Other seronegative.
Signal identification and development I.Ralph Edwards.
1 The Relationship between Pay-for- Performance Incentives and Quality Improvement: A Survey of Massachusetts Physician Group Leaders Ateev Mehrotra, Steven.
Amar K. Das, MD, PhD Associate Professor of Biomedical Data Science, Psychiatry and Health Policy & Clinical Practice Geisel School of Medicine at Dartmouth.
Virtual ACE Update.
Outcome Measures in PsA Philip Mease MD Seattle, WA.
GRAPPA Guidelines for PsA: Considerations GRAPPA Guidelines Mission Statement: “To develop guidelines, based upon the best scientific evidence, for the.
Developing outcome prediction models for acute intracerebral hemorrhage patients: evaluation of a Support Vector Machine based method A. Jakab 1, L. Lánczi.
IMAGING NEURAL CORRELATES OF FATIGUE & PAIN IN PATIENTS WITH RA TREATED WITH TNF INHIBITORS FARHAN BARI SPR RHEUMATOLOGY WARN CONFERENCE 22 ND MAY 2014.
Methods We employ the UMLS Metathesaurus to annotate ICD-9 codes to MedDRA preferred terms (PTs) using the three-step process below. The mapping was applied.
Getting the detail right Standardisation and SNOMED CT © Nottingham University Hospitals NHS Trust.
Tumor necrosis factor antagonist use and associated risk reduction of cardiovascular events among patients with rheumatoid arthritis The Annals of the.
Extracting CHF information from clinical text using CLAMP Hua Xu, PhD pSCANNER
Showcasing work by Jonnageddala, Liaw, Ray, Kumar, Chang, and Dai on
Building chronic disease registries from EMR and administrative data
Anti-TNF therapy improves hand function in rheumatoid arthritis
CRF &SVM in Medication Extraction
Development and Validation of HealthImpactTM: An Incident Diabetes Prediction Model Based on Administrative Data Rozalina G. McCoy, M.D.1, Vijay S. Nori,
Promises and Perils of Machine Learning: A Real-world Evidence Fable
Roland C. Merchant, MD, MPH, ScD
Rhematoid Rthritis Respiratory disorders
Figure 2 Overlap of associated loci among five rheumatic diseases
Division of Viral Hepatitis, CDC
Clinical Research Informatics: From Bedside to Silicon and Back
Digital Biomarkers – Patient data mining & precision medicine Stefan Schulz, Medical University of Graz Donausymposium Vienna, March 14, 2018.
Using clinical trial data as real-world evidence
Women With Psoriasis Are Struggling More Than Men
Presentation transcript:

i2b2 Rheumatoid Arthritis DBP Defining RA in the electronic health record for future studies Elizabeth Karlson, MD Associate Professor of Medicine Harvard Medical School Brigham and Women’s Hospital

Background: Partners Resources i2b2: “Informatics for Integrating Biology and the Bedside” RPDR: “Research Patient Data Repository” Natural Language Processing (HiTEX) Gold standard dataset: Training set: 500 manual chart reviews Validation set: 400 manual chart reviews

ICD-9 codes for related phenotypes Coded data ICD-9 codes for RA ICD-9 codes for related phenotypes Lupus (SLE), psoriatic arthritis (PsA), juvenile inflammatory arthritis (JIA) Lab results for RA related antibodies Rheumatoid factor (RF), anti-CCP Medications physician entry, escripts

Different colors depending on yes/no/mentioned Click on tick mark to take to note or mention in record 4

NLP Concepts NLP queries Rheumatoid arthritis RA-related antibodies Anti-CCP/RF/seropositive Result coded as positive/negative RA Medications Coded as any mention Radiographs: RA erosions Coded as any erosion 5

Approach to develop RA cohort Classification algorithm Step 1: Develop gold standard training set Step 2: Identify variables important for predicting RA Step 3: Develop algorithm

Chart review results RA Mart, N=32,000 ICD9 = 714.xxx OR CCP test ordered Manual chart review for 500 patients 20% validation rate definite RA=100 possible/no RA= 400

Comparison of NLP to manual chart review Precision of NLP queries Methotrexate 100% Etanercept 100% CCP+ 98.7% Seropositive 96% Erosion 88%

Approach to develop RA cohort Classification algorithm Step 2: Define variables (Vivian Gainer, Sergey Goryachev, Qing Zeng-Treitler, Shawn Murphy) Codified data ICD9 billing codes Electronic medication prescription CCP, RF lab results Narrative data extracted using natural language processing (NLP), i.e. from physician notes, radiology reports Erosions RF positive, CCP positive, seropositive RA medications

Approach to develop RA cohort Classification algorithm Step 3: Develop algorithm (Tianxi Cai) Penalized logistic regression with adaptive LASSO Parsimonious predictors selected based on BIC We then applied the classification algorithm to the RA Mart 10

Narrative + Codified 3585 94 63 reference Codified only 3046 88 51 6 Model RA PPV (%) Sensitivity (%) Difference in PPV Algorithms Narrative + Codified 3585 94 63 reference Codified only 3046 88 51 6 NLP only 3341 89 56 5 Published administrative codified criteria ≥ 3 ICD9 RA 7960 80 38 ≥1 ICD9RA + med 7799 45 66 49

Top 5 predictive variables for RA Standardized regression coefficient Standard error NLP rheumatoid arthritis 1.11 0.48 NLP seropositive 0.74 0.26 ICD9 RA normalized 0.71 0.23 ICD9 RA 0.66 0.44 NLP erosions 0.46 0.29 An interesting aspect of our algorithm is that 3 out of the 5 most predictive variables for RA were derived using NLP. 12

Liao, et al., Arthritis Care & Research 2010 i2b2 RA cohort Characteristics I2b2 RA, n=3,585 CORRONA*, n=7,971 Age, mean (SD) 57.5 (17.5) 58.9 (13.4) Women (%) 79.9 74.5 Anti-CCP+ (%) 63 N/A RF+ (%) 74.4 72.1 Erosions (%) 59.2 52.8 MTX use (%) 59.5 TNFi use (%) 32.6 22.6 Although we were happy with the performance of our classification algorithm, we also wanted to know that these RA subjects clinically had RA. CORRONA- a more traditional cohort created by patient recruitment *Consortium of Rheumatology Researchers of North America Liao, et al., Arthritis Care & Research 2010 13

i2b2 Virtual RA Cohort Studies Case-control cohort ~4,000 RA cases ~13,000 matched non-RA controls Age, gender, race and health care utilization Samples collected from 1500 cases/1500 controls for genotyping Genetic risk score predicts RA with same magnitude as in GWAS (Kurreman, 2010) CAD outcomes in RA cases being validated in i2b2 Pharmacogenetics Research Network (PGRN) I’m going to start with what we have. 14

Selected codified data from RPDR i2b2 RA Project: Selected codified data from RPDR Performed NLP queries for RA features Developed algorithm based on: coded + NLP data Liao, 2010

PGRN Methods: Select codified data from RPDR (meds) Perform NLP queries for RA disease activity features Develop algorithm (s) based on: Meds + NLP data

17

PGRN Specific Aims Aim 1: Define RA disease activity level in the EMR Aim 2: Develop an algorithm to predict RA disease activity from EMR data Aim 3: Define temporal relations between RA medications and disease activity to define to define treatment response in RA

Background In RA, disease activity score (DAS28) is considered the gold standard tool to evaluate disease activity and response to treatment in clinical practice DAS28 has 2 components: Disease activity level Change in disease activity level

Disease activity level scored as low, moderate, high Disease activity change scored as low, moderate, high Van Gestel AM et al. Arthritis Rheum 1996; 39: 34-40

Research Methods Construct a virtual cohort of RA patients (N=5906) Review charts for disease activity (document level) Remission Low Moderate  Remission/Low vs. High/Moderate High Indeterminant Annotate charts for disease activity features (Knowtator) Disease_disorder Symptoms (reported pain, stiffness, swelling) Signs (objective tenderness, limited range of motion, synovitis) Anatomic site (relations with signs and symptoms) RA medication signature RA labs, level of inflammation (CRP, ESR) Patient functioning (activities of daily living)

NLP Methods Move from keyword matching in i2b2 to ontology mapping in PGRN Customize cTAKES for RA medications RA anatomic sites Find relations between entities Define new modules RA medication changes (start/stop) Reasons to stop medications Lab values Patient functioning status

NLP Analytic Approaches 1- Internal gold standard datasets N=200 BWH annotated notes N= 200 MGH annotated notes 2- Analyses Study whether MD summary (1-3 sentences) predicts disease activity SVM: construct vectors based on features and relations to predict disease activity Bag of concepts to predict disease activity  2- External gold standard datasets: DAS28 scores from standardized tool at MGH matched to clinical note DAS28 scores from BRASS matched to clinical note

Future work Define temporal relations between anti-TNF medication use (eg. new starts) and pre and post start disease activity to define response to therapy Construct disease activity timeline (patient level) Construct medication timeline (patient level)

Use NLP to define temporal sequence of medication start and adverse event

Questions?