Using EMR Data for Population Registries Diana Gumas, JHMCIS Senior Director for Research Systems, EPR and EPR2020/Amalga David Thiemann, Center for Clinical.

Slides:



Advertisements
Similar presentations
Tips to a Successful Monitoring Visit
Advertisements

1 EPR2020 Project December 16, 2009.
David Cloutier Director, Research Center Management and Development Budgeting for Industry Sponsored Clinical Trials.
Area 4 SHARP Face-to-Face Conference Phenotyping Team – Centerphase Project Assessing the Value of Phenotyping Algorithms June 30, 2011.
March - April 2003 Boston Children’s Hospital e Standardization and Automatic Extraction of Quality Measures in an Ambulatory EMR Denni McColm, CIO,
ICU Clinical Information Management System An Investigation for a Pediatric Intensive Care Unit Steven Sousa Ann Thompson.
BTRIS: The NIH Biomedical Translational Research Information System James J. Cimino Chief, Laboratory for Informatics Development NIH Clinical Center.
Data Warehouse success depends on metadata
BTRIS: The NIH Biomedical Translational Research Information System James J. Cimino Chief, Laboratory for Informatics Development NIH Clinical Center.
© 2013 The McGraw-Hill Companies, Inc. All rights reserved. Chapter 9 Tests, Procedures, and Codes.
1 © Prentice Hall, 2002 Chapter 11: Data Warehousing.
Integrated Practice Management Systems. Learning Objectives After reading this chapter the reader should be able to: Document the workflow in a medical.
INTRODUCTION TO ICD-9-CM PART TWO ICD-9-CM Official Guidelines (Sections II and III): Selection of Principal Diagnosis/Additional Diagnoses for Inpatient.
6.07 General Enhancements. Why are we upgrading the Meditech software? Patient Quality and Safety Meaningful Use Requirements Health Care Reform Act Reimbursement.
ETL By Dr. Gabriel.
Administrative Data Sources Nov 10, Decrease MPOG data contribution variation Current “required” data elements for MPOG sites – Intraoperative anesthesia.
Preparing for ICD-10 Transition Teleconference Hosted by Community Health Association of Mountain/ Plains States (CHAMPS) January 30, 2013, 12:00 PM –
1 Understanding and Using NAMCS and NHAMCS Data: A Hands-On Workshop Susan M. Schappert Donald K. Cherry.
Clinical Registries Needs and Solutions Dr. Peter Greene, CMIO Diana Gumas, IT Director 1.
Performance Dashboard
0 ICC Community CY 2013 Chart Audit September 9, 2014 Audit Timeframe: Jan 1, 2013 – Dec 31, 2013.
Consent2Share Linking Cohort Discovery to Consent David R Nelson MD Assistant Vice President for Research Professor of Medicine Director, Clinical and.
Performance Measures 101 Presenter: Peggy Ketterer, RN, BSN, CHCA Executive Director, EQRO Services Health Services Advisory Group June 18, :15 p.m.–4:45.
The NHCS Pretest: Incorporating DAWN Rong Cai, Charles Day DAWN, CBHSQ, SAMHSA August 8, 2012.
Why Use MONAHRQ for Health Care Reporting? May 2014 Note: This is one of seven slide sets outlining MONAHRQ and its value, available at
© Copyright IBM Corporation 2008 Health Analytics: An Overview HealthTech Net November 20, 2008 Richard Singerman, Ph.D.
Underdiagnosis of Pediatric Hypertension – An Example of the Potential of Electronic Medical Record Research for Clinical Pediatricians David C Kaelber,
Why Use MONAHRQ for Health Care Reporting? March 2015 Note: This is one of eight slide sets outlining MONAHRQ and its value, available at
Finance and Contract Pre-Award Process Rachel Sheppard Regulatory Director, OCRSS.
1 PAP Tools: RxAssist Plus EVOLUTION OF THE COMPUTERIZED PAP.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Using EPR2020 for Clinical Research Diana Gumas, Director of Clinical and Clinical Research IT 1.
Performance Measures 101 Presenter: Peggy Ketterer, RN, BSN, CHCA Executive Director, EQRO Services Health Services Advisory Group March 28, :00.
Decision Support and Date Warehouse Jingyi Lu. Outline Decision Support System OLAP vs. OLTP What is Date Warehouse? Dimensional Modeling Extract, Transform,
Fee Basis & NPPD Data Mark W. Smith, Ph.D. August 3, 2005 Health Economics Teleconference Seminar access code
Instructor: Mary “Stela” Gallegos, ABD, (RT), (R), (M) Seminar 4.
Unit 3.02 Understanding Health Informatics.  Health Informatics professionals treat technology as a tool that helps patients and healthcare professionals.
Chapter 7: Indexes, Registers, and Health Data Collection
Research Tools Brought to you by the Clinical and Translational Science Institute Presented by: Terri Shkuda Systems Analyst Research Informatics The Penn.
Health Informatics Health Informatics professionals treat technology as a tool that helps patients and healthcare professionals Understand health.
Enhancing interoperation: an i2b2 ETL schema for Epic EHRs
Research Approval Workflow EPIC Optimization
3.02 Understand Health Informatics
Clinical Terminology and One Touch Coding for EPIC or Other EHR
3.02 Understand Health Informatics
The Role and Responsibilities of the Clinical Research Coordinator
Data Management Planning for JHU SOM IRB Approval
Quality of Electronic Emergency Department Data: How Good Are They?
Table 1: Patient Demographics
Clinical Data Exchange – Report Card
Enhancing interoperation: an i2b2 ETL schema for Epic EHRs
Northwestern Medicine Enterprise Data Warehouse
3.02 Understand Health Informatics
3.02 Understand Health Informatics
Amanda L. Do, MPH1,2, Ruby Y. Wan, MS1,2, Robert W
Structure of Industry Budgets
Claire McKinley, PMP, CCRP
1422 Pre- Diabetes and Undiagnosed HTN Measures
Clinical Trials Budgeting
Lesson Four: Accessing Demographics & Summary Information
Northwestern Medicine Enterprise Data Warehouse
3.02 Understand Health Informatics
Managing Medical Records Lesson 1:
Research in Epic Update
How to Use i2b2 URL to i2b2: Questions?
3.02 Understand Health Informatics
3.02 Understand Health Informatics
David Gilmore & Richard Blevins Senior Consultants April 17th, 2012
IRB protocol no PI: Dr. David F. Chhieng
Auditing Techniques for Ensuring Quality Data in a Registry
Presentation transcript:

Using EMR Data for Population Registries Diana Gumas, JHMCIS Senior Director for Research Systems, EPR and EPR2020/Amalga David Thiemann, Center for Clinical Data Analysis 1

Potential Data Uses Sample Size Estimates (aggregate data without IRB approval) –Feasibility, grant applications, statistical planning Identifying patients for enrollment/recruitment –By diagnosis, pathology, stage, labs, meds Identifying/creating matched study controls Obtaining current demographics (name, address) for mail solicitation –From research list or by clinic, provider, clinical criteria Obtaining ongoing clinical + administrative data on a registry panel –Labs, visits, procedures, immunizations, CPT/ICD9 codes, resource use 2

Possible research data sources EPR (JHH & JHBMC) Sunrise Clinical Manager (JHH – inpatient) Meditech (Bayview) Casemix Datamart GE Centricity (JHCP) EPR2020 Departmental Systems (ED, OR, Anesthesia) Clinical Research Management System (CRMS) IDX (professional fees) Death Registry 3

Methods for Data Access Historical: Researcher Negotiates Access With Clinical System /DBA –Logistic nightmare, technical challenge Clinical Research Management System (CRMS) –Study cohort with real-time links to enterprise data Center for Clinical Data Analysis –Monthly/quarterly data extracts from designated systems 4

Clinical Research Management System (CRMS) 5 1,054 Users 1079 Active Studies 25,430 Participants Data Available in CRMS –eIRB –EPR (patient demographics) –Study participants / accruals –Electronic Case Report Forms - in next 2-3 months

Clinical Research Management System (CRMS) 6 Ways to extract data –Canned Reports (click for examples) –Ad-hoc querying using SQL –Possible with CCDA support - automated study-specific data extracts

EPR2020 Data for Researchers 7 4.2M Patients, 23.4M Visits 12.3M Documents, 6.8M Radiology Reports 25.6M Lab Results 1.5M Problems, 2.2M Medications, 140K Allergies Planned Bayview & JHCP data ICD9 diagnosis codes and CPT charges (IDX) Future Death Registry Blood Product Data for Transfusions Eclipsys SCM Order data HMED (ED), ORMIS, eADR/Medivision From EPR Today

My Participant’s Lab Data 8 Reliable. Driven by the CRMS Participant Registry. Exportable.

Registry Cohort Discovery using EPR2020 A JHM investigator wants to find and enroll diabetic patients aged years with hemoglobin A1C between 7 and 9% serum creatinine < 2 mg/dl 9

Center for Clinical Data Analysis (CCDA) Provides periodic (monthly/quarterly) bulk data extracts (delimited/flat files,.xls): Preliminary, anonymous data for feasibility, grant applications and statistical sample-size estimates IRB-approved case-finding--for study enrollment (mailings, phone solicitation), chart review, and cohort/case-control studies Research data extracts - monthly/quarterly integrated extracts from EPR, POE, ORMIS, lab/PDS, billing systems, vaccination/transfusion/culture data, etc. 10

How CCDA works cc: phone For IRB-approved research: –Provide full protocol + IRB approval –Meet to discuss query methods, format –Iterate, then schedule prod ( extracts, Jshare) –Cost: $100/hour For non-IRB projects (exploratory analyses, QI) –Same process, cost subsidized by ICTR/JHM –Do NOT implicitly morph QI into IRB 11

The Basics: Getting Clinical Data Into a Registry Database Real work, not ad hoc/bootstrap Need $$$ and FTE(s) Smart analyst(s) who know database technology and understand (or can learn) nuances of the sources and content domain Hands-on PI management/guidance Statistical liason early, before database schema and ETL methods are set in stone 12

The Extract-Transform-Load process: Getting Clinical Data into Research DB Raw clinical/administrative data is useless for research Build an intermediate (staging) database –Don’t do data management in SAS/Stata/Excel Data dictionary—derivation for each field Templated, tested, documented cleanup scripts/routines. Intermediate tables: Log each step/modification –For each batch, be able to re-create data transform from scratch –Version control, change control and documentation are vital –Build data versioning into the database 13

Transforming Data Raw data typically string (char/text) fields Unanalyzable characters (*, comments) still have meaning –Put non-numeric data in separate field. Avoid numerical recoding (999) ~3% of pts have multiple/non-preferred MRNs –Need 1-to-many link table Assays/reference ranges/coding changes –Avoid using raw codes (CPT/ICD) in research db –Map clinical codes to research terms Defer analytic assumptions. When recoding data, anticipate problems. Keep options open. 14

More Data Transform Challenges NEVER trust raw data. Learn business logic of source system. –CPTs morph annually, internal complexity/redundancy –Lab assays/reference/terms change –Parsing is inherently unreliable –Administrative names/groups change (clinic #s, departments). Duplicate-value problems (labs, orders) System-attribution source/datetime (POE, lab) Always run an aggregate (“group by” ) query to identify alternative names (eg lab name) and values (number, result) before transform. Otherwise you’ll miss something 15

Understanding Business Logic Trust but verify: Test coding accuracy –Providers may habitually use imprecise/inaccurate diagnosis codes (especially in profee data) –ICD9 procedure indications often a billing fiction –Trained coders may make systematic errors –Different content domains may have different standards (inpt vs outpt coders) –Don’t infer/assume dependencies unless enforced by source system. Run min/max queries, aggregates, outer joins –Confirm date ranges, data ranges, relative proportions by year Don’t assume that null rows actually are empty. Maybe the query missed something 16

JHM Clinical Data Landscape: Past, Present and Future Past : Babble of unintegrated systems EPR (antiquated technology, VSAM files, DB2) contains text, not queryable, analyzable data Present: EPR2020 (aka Amalga) –integrated data!! Has everything in EPR, plus JHCP, plus gradually adding data from clinical/departmental/administative systems (IDX CPTs, transfusion medicine, ORMIS, HMED, eADR, death registry, ad infinitum) Future: ? Epic, ? JHM Data Warehouse Epic: One system replacing all major JHM systems JHH timeline: 4+ years 17

JHM Data Sources: Casemix Datamart Gold standard for JHM (non-profee) administrative data, including payer/insurance data Combines data from Keane (hospital charges), ADT (admission/discharge/transfer), HDM (ICD9 diagnosis + procedure coding), HSCRC (regulatory submissions) Not a true data warehouse; meager reconciliation Best source for length of stay, resource use, ICD9 diagnoses Outpatient ICD9s limited Has JHH + BMC + HCGH data 18

JHM Data Sources: IDX (profee) Gold standard for inpatient +outpatient CPT (profee charge) data ICD9 diagnosis data problematic Limitation: No data from non-faculty providers (private physicians, etc.) Difficult to query. Has a data warehouse, limited access. Early target for EPR2020/Amalga integration. 19

JHH Data Sources: SCM/POE Sunrise Clinical Manager/Provider Order Entry Replicated transactional database, difficult to query For registry purposes POE has large attribution/process challenges: Stutter-step orders, multiple alerts, imputed times Great source for inpatient meds, labs, physiologic monitor data No codified ICD9/Snomed/RxNorm data No outpatient data 20

JHH Data Sources: SCC/AIM Sunrise Critical Care (aka Emtek, Eclipsys). JHH ICUs + stepdown units + oncology AIM analytic database contains selected but comprehensive batch extract Sunsets as ICUs switch to POE ClinDoc Challenging to query. Lots of denormalized fields 21

JHH + BMC Data Sources: PDS PDS=Pathology Data Systems Includes lab, transfusion medicine, anatomic pathology, cytopath, John Boitnott’s death registry Lab data also available via EPR2020/Amalga and POE 22

BMC Data Sources: Meditech Shrink-wrapped, comprehensive inpatient + outpatient clinical + financial system Difficult for ad hoc research queries. Exports data to Datamart and EPR2020 BMC-JHH patient linkage doable but difficult, needs caution 23

JHCP Data Sources: GE Centricity All clinical + administrative data for JHCP clinics Largely opaque to research query; JHCP sometimes collaborates directly, especially for its physician/investigators Early target for EPR2020/Amalga integration Linkage challenges to BMC and JHH mrns 24

JHH Departmental Data: ORMIS + eADR/Medivision ORMIS: Operating Room Management Information System Mostly transactional scheduling/tracking/administrative data, limited clinical data. Has diagnoses, procedures, case start/stop times eADR/Medivision (anesthesia) still evolving, limited research data access Design challenges similar to legacy SCC critical- care system. 25

JHH Departmental Data: HMED (Emergency Department) Mostly opaque to research Replicated data hosted by Datamart 26