Architecture-Specific Considerations and Methods for Data Quality Assessment in Collaborative Clinical Data Research Networks Chunhua Weng, PhD Associate.

Slides:



Advertisements
Similar presentations
Nursing Diagnosis: Definition
Advertisements

Quality Data for a Healthy Nation by Mary H. Stanfill, RHIA, CCS, CCS-P.
Alison Wallis, Clinical Advisor Putting Scottish Community Nurses on the Map.
Amy Sheide Clinical Informaticist 3M Health Information Systems USA Achieving Data Standardization in Health Information Exchange and Quality Measurement.
Inthira Kanchanaphibool Faculty of Pharmacy, Silpakorn University, Thailand 1.
University of Pittsburgh Department of Biomedical Informatics Healthcare institutions have established local clinical data research repositories to enable.
Supporting Medical Decision Making with Electronic Medical Records James J. Cimino Departments of Medicine and Medical Informatics Columbia University.
Documentation for Acute Care
Harnessing World Wide Web Technology and Standardized Terminology to Improve Decision Making for Patients and Providers James J. Cimino Departments of.
Tying it All Together Using EMRs to Support Quality Improvement 600 East Superior Street, Suite 404 I Duluth, MN I Ph or
Constructing a Data Management System National Center for Immunization & Respiratory Diseases Influenza Division Regional Training Workshop on Influenza.
Supported by the Patient-Centered Outcomes Research Institute (PCORI) Contract CDRN PopMedNet in the pSCANNER Network PMN User Meeting.
Medical Informatics "Medical informatics is the application of computer technology to all fields of medicine - medical care, medical teaching, and medical.
California Outcomes Measurement System – Treatment
Electronic Health Records Dimitar Hristovski, Ph.D. Institute of Biomedical Informatics.
National Efforts for Clinical Decision Support (CDS) Erik Pupo Deloitte Consulting.
Modular Chronic Disease Registries for QI and Comparative Effectiveness Research Keith Marsolo, PhD Assistant Professor Division of Biomedical Informatics.
Developing National Health Information Infrastructure (NHII) in the U.S. William A. Yasnoff, MD, PhD, FACMI Senior Advisor National Health Information.
Rotavirus Vaccine: use in Wisconsin, effects on primary care visits, hospitalizations, and laboratory detections Jonathan L. Temte, MD/PhD Associate Professor.
Unit 3.02 Understanding Health Informatics.  Health Informatics professionals treat technology as a tool that helps patients and healthcare professionals.
Integrating a Federated Healthcare Data Query Platform With Electronic IRB Information Systems Shan He IPHIE 2010.
Population Management What is that and why do I need to know? Elisha Brownfield, MD.
Application of the OHDSI CDM on Korean Healthcare System October, Department of Biomedical Informatics Ajou University School of Medicine Rae Woong.
1 DAF Phase 3 Objective  Enable researchers to access data from multiple organizations and data sources within a Learning Health System (LHS) infrastructure.
Amar K. Das, MD, PhD Associate Professor of Biomedical Data Science, Psychiatry and Health Policy & Clinical Practice Geisel School of Medicine at Dartmouth.
Learning Outcomes Discuss current trends and issues in health care and nursing. Describe the essential elements of quality and safety in nursing and their.
A Rapid-Learning Health System Using in silico research Lynn Etheredge Wolfram Data Summit - September 9, 2010.
PFF Teal = MAIN COLORS PFF Green = Light Green = Red = HIGHLIGHT COLORS Light Grey = Dark Grey =
Informatics Tools and Services Biomedical Informatics Core Tim Aro.
N VISUAL ANALYTICS FOR HEALTHCARE: BIG DATA, BIG DECISIONS David Gotz Healthcare Analytics Research Group IBM T.J. Watson Research Center.
Clinical Trials for Comparative Effectiveness Research Mark Hlatky MD Mark Hlatky MD Stanford University January 10, 2012.
How do data and creativity relate? 2007 “gets the right care to people when they need it and then captures the results for improvement”
© 2016 Chapter 6 Data Management Health Information Management Technology: An Applied Approach.
Tim Friede Department of Medical Statistics
Deploying ONC terminology standards in SNOW SHRINE i2b2 data warehouse
Deploying ONC terminology standards in SNOW SHRINE i2b2 data warehouse
3.02 Understand Health Informatics
An Open Collaborative Approach for Rapid Evidence Generation
REDCap General Overview
Building chronic disease registries from EMR and administrative data
Using Technology to Support Evidence Based Practice
OHDSI : Design and implementation of a comparative cohort study in observational health care data 아주대학교 의료정보학과 유승찬.
Electronic Health Records (EHR)
Models for Data Quality and Data Quality Assessment
Nutrition in HL7 Standards
pSCANNER’s Value: Beckstrom’s Law at Work
Component 11 Configuring EHRs
Impact of State Reporting Laws on Central Line– Associated Bloodstream Infection Rates in U.S. Adult Intensive Care Units Hangsheng Liu, Carolyn T. A.
Nutrition in HL7 Standards
3.02 Understand Health Informatics
3.02 Understand Health Informatics
Evan Sholle, MS Weill Cornell Medicine Twitter: #AMIA2017
Efficient Remediation of Terms Inactivated by Dictionary Updates
Shyam Visweswaran, MD, PhD
Data Quality of the Arkansas Clinical Data Repository
Managing Clinical Information: EHR Terms and Architecture
Opening slide 3:00 PM, EST.
HIMSS Advocacy Day Washington, DC April 1, 2004
Outline I have trouble working with ODBs
3.02 Understand Health Informatics
Systems Thinking for Everyday Work (STEW) Worksheet
Metadata Construction in Collaborative Research Networks
Information Retrieval for Evidence-based Practice
April 18th 2018 Moderator: Matthew Rioth
3.02 Understand Health Informatics
3.02 Understand Health Informatics
Interoperability with SNOW SHRINE
From Innovation to Commercialization Access to Data
Registered Nurse’s Use of HIT, 2006: Findings from a National Survey
REACHnet: Research Action for Health Network
Presentation transcript:

Architecture-Specific Considerations and Methods for Data Quality Assessment in Collaborative Clinical Data Research Networks Chunhua Weng, PhD Associate Professor of Biomedical Informatics Columbia University, New York City May 27, 2015

No competing interests to disclose Disclosures No competing interests to disclose

EHR data are subject to quality problems “With the advent of the information era in medicine, we are pouring out a torrent of medical record misinformation. Medical records, which have long been faulty, contain more distorted, deleted, and misleading information than ever before.” Burnum (1989) The misinformation era: the fall of the medical record.

What data quality problems should we try to prevent when creating clinical data research networks for CER?

Three CDRN Architecture Models Centralized Query of Everything Federated Query of Patient Counts Federated Query of Research Results

CDRN Model 1 Central CDW CDM-based De-identified data CDM-based index index index CDM-based De-identified data CDM-based De-identified data CDM-based De-identified data ETL ETL ETL CDW-1 CDW-2 CDW-n

What are data quality concerns for model 1? When and Where Data Quality Concerns At local level before ETL Sampling bias1, correctness, currency2, completeness3, concordance, plausibility, At local level during ETL Information loss, transformation/coding error At the central level during indexing Inconsistency or redundancy across sites At the central level after indexing currency, data provenance, research suitability 1. Rusanov A*, Weiskopf NG*, Wang S, Weng C, Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research, BMC Medical Informatics and Decision Making, 2014 Jun 11;14(1):51, 2. Weiskopf NG, Weng C, Methods and Dimensions of EHR Data Quality Assessment: Enabling Reuse for Clinical Research, J Am Med Inform Assoc. 2013 Jan 1;20(1):144-51 3. Weiskopf NG, Hripcsak G, Sushmita S, Weng C, Defining and measuring completeness for electronic health records for secondary use. J Biomed Inform, 2013 Oct;46(5):830-6. 

Example Data Quality Issues at Local Level Bias: Biases in lab ordering for specific population subgroups Biases towards specific population subgroups: do data represent the overall population? Bias in data measurement Completeness: Missing required data elements or variables for CER of particular diseases Completeness: Missing data due to data fragmentation Correctness: Accuracy of ICD-9 diagnosis Correctness: Incorrect coding/use of CDM or terminology Currency: Outdated gender information for transgender patients Plausibility: Discharge date is 25 days earlier than admission date

Example Data Quality Issues at Central Level Concordance: The same patient has different values from data submitted by different sources Redundancy: The same patient is created multiple times in the central database and treated as multiple patients Currency: Lack of timely sync between local and central data Data provenance: Unable to trace back to data sources: e.g., did you put in discharge diagnosis or admission diagnosis? Not granular: Lack of granularity of coding

Defining and Measuring Data Completeness

Defining and Measuring Data Completeness

CDRN Model 2: SHRINE Source of Information: https://open.med.harvard.edu/project/shrine/

CDRN Model 2: i2b2-SHRINE CDM-based data CDM-based data CDM-based data Shared Query Aggregated Counts CDM-based data CDM-based data CDM-based data CDM-based data ETL ETL ETL ETL CDW-1 CDW-2 CDW-3 CDW-n

What are data quality concerns for model 2? When and Where Data Quality Concerns At local level before ETL Sampling bias1, correctness, completeness3, concordance, plausibility, currency2 At local level during ETL Information loss, transformation/coding error At the central level during indexing Inconsistency or redundancy across sites At the central level currency, data provenance, research suitability 1. Rusanov A*, Weiskopf NG*, Wang S, Weng C, Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research, BMC Medical Informatics and Decision Making, 2014 Jun 11;14(1):51, 2. Weiskopf NG, Weng C, Methods and Dimensions of EHR Data Quality Assessment: Enabling Reuse for Clinical Research, J Am Med Inform Assoc. 2013 Jan 1;20(1):144-51 3. Weiskopf NG, Hripcsak G, Sushmita S, Weng C, Defining and measuring completeness for electronic health records for secondary use. J Biomed Inform, 2013 Oct;46(5):830-6. 

CDRN Model 3: OHDSI

CDRN Model 3: OHDSI CDM-based data CDM-based data CDM-based data Shared Protocol Aggregated Evidence CDM-based data CDM-based data CDM-based data CDM-based data ETL ETL ETL ETL CDW-1 CDW-2 CDW-3 CDW-n

What are data quality concerns for model 3? When and Where Data Quality Concerns At local level before ETL Sampling bias1, correctness, completeness3, concordance, plausibility, currency2 At local level during ETL Information loss, transformation/coding error At the central level during indexing Inconsistency or redundancy across sites At the central level currency, data provenance, research suitability, metadata transparency and completeness 1. Rusanov A*, Weiskopf NG*, Wang S, Weng C, Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research, BMC Medical Informatics and Decision Making, 2014 Jun 11;14(1):51, 2. Weiskopf NG, Weng C, Methods and Dimensions of EHR Data Quality Assessment: Enabling Reuse for Clinical Research, J Am Med Inform Assoc. 2013 Jan 1;20(1):144-51 3. Weiskopf NG, Hripcsak G, Sushmita S, Weng C, Defining and measuring completeness for electronic health records for secondary use. J Biomed Inform, 2013 Oct;46(5):830-6. 

Tool-Assisted Data Quality Assessment

Take home Different CDRN architectures entail different data quality assessment requirements Federated approach involving autonomous sites may minimize data query checking complexities Tools such as Achilles from OHDSI can be used for data quality assessment We need a rapid learning system for DQA