Funding was provided by a contract from AcademyHealth. Additional support was provided by AHRQ 1R01HS019912-01 (Scalable PArtnering Network for CER: Across.

Slides:



Advertisements
Similar presentations
Dr Eva Batistatou. Outline of this presentation… What is epidemiology? The Fundamentals of Epidemiology course What is biostatistics? The Biostatistics.
Advertisements

Donald T. Simeon Caribbean Health Research Council
Comparator Selection in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
Lesson 3 ODOT Analysis & Assessment. Analysis & Assessment Learning Outcomes As part of a small group, apply the two- part analysis by generating exposure-
Clinical research and the electronic medical record: Interdisciplinary research agendas Michael G. Kahn MD, PhD Biomedical Informatics Core Director Colorado.
Copyright ©2011 Freedman Healthcare, LLC All Payer Claims Datasets: Big Data is Coming to Public Health Officials, Providers and Patients Near You StrataRx.
Community Health Centers Implementing EHRs: Lessons Learned Oliver Droppers, M.P.H., Sherril Gelmon, Dr.P.H., Siobhan Maty, Ph.D., and Vickie Gates Portland.
Local Health Department Perspective Electronic Medical Record Software and Health Information Exchanges Kathleen Cook Information & Fiscal Manager, Lincoln-Lancaster.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Insurance Continuity and Receipt of Diabetes Preventive Care in Oregon’s Community Health Centers.
Chapter 7. Getting Closer: Grading the Literature and Evaluating the Strength of the Evidence.
Enhancing Surveillance with the Colorado Child Health Survey Jodi Drisko, MSPH Jason Gannon Alyson Shupe, MSW, PhD Colorado Department of Public Health.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Supported by The Children’s Hospital Research Institute and the NIH/NCRR Colorado CTSI Grant Number UL1 RR Its contents are the authors’ sole responsibility.
Inter-institutional Data Sharing, Standards and Legal Arthur Davidson, MD, MSPH Agency for Healthcare Research and Quality, Washington, DC June 9, 2005.
Conceptual Model Building: Overview Felicia Hill-Briggs, PhD, ABPP Associate Professor Departments of Medicine and Health, Behavior, and Society, Welch.
A Robust Health Data Infrastructure P. Jon White, MD Director, Health IT Agency for Healthcare Research and Quality
Supported by the Patient-Centered Outcomes Research Institute (PCORI) Contract CDRN PopMedNet in the pSCANNER Network PMN User Meeting.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
The London Older People Service Development Program (LOPSDP) The ‘Medicines Management’ Project (January to July 2003) Lelly Oboh Project Co-ordinator.
Funding was provided by a contract from AcademyHealth. Additional support was provided by AHRQ 1R01HS (Scalable PArtnering Network for CER: Across.
STUDY PLANNING & DESIGN TO ENHANCE TRANSLATION OF HEALTH BEHAVIOR RESEARCH Lisa Klesges, Russell Glasgow, Paul Estabrooks, David Dzewaltowski, Sheana Bull.
WP.5 - DDI-SDMX Integration
Including a detailed description of the Colorado Growth Model 1.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Sina Keshavaarz M.D Public Health &Preventive Medicine Measuring level of performance & sustaining improvement.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Teaching the Science Base of MCH Donna Strobino, PhD.
ITEC224 Database Programming
1 Introduction to Grant Writing Beth Virnig, PhD Haitao Chu, MD, PhD University of Minnesota, School of Public Health December 11, 2013.
ISO 9000 & TOTAL QUALITY ISO 9000 refers to a group of quality assurance standards established by the International Organization for Standardization.This.
My Own Health Report: Case Study for Pragmatic Research Marcia Ory Texas A&M Health Science Center Presentation at: CPRRN Annual Grantee Meeting October.
Exposure Definition and Measurement in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
©Ian Sommerville 2000, Mejia-Alvarez 2009 Slide 1 Software Processes l Coherent sets of activities for specifying, designing, implementing and testing.
IST 210 Database Design Process IST 210 Todd S. Bacastow January 2005.
Study Designs Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /4/20151.
HIT Policy Committee Quality Measures Workgroup October 28, 2010 Fred D Rachman, MD.
INTERNATIONAL SOCIETY FOR TECHNOLOGY IN EDUCATION working together to improve education with technology Using Evidence for Educational Technology Success.
Copyright © 2010 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 15 Community As Client: Applying the Nursing Process.
Chapter 6 – Data Handling and EPR. Electronic Health Record Systems: Government Initiatives and Public/Private Partnerships EHR is systematic collection.
Systematic Review Module 7: Rating the Quality of Individual Studies Meera Viswanathan, PhD RTI-UNC EPC.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
HIT Policy Committee NHIN Workgroup Recommendations Phase 2 David Lansky, Chair Pacific Business Group on Health Danny Weitzner, Co-Chair Department of.
Method How are data being collected? Data collection is done manually from paper IRB records. Every member of the CTSI Study Registry team is thoroughly.
Amy Fine Center for the Study of Social Policy
ArcGIS Data Reviewer: An Introduction
Preparing Electronic Health Records for Multi-Site CER Studies Michael G. Kahn 1,3,4, Lisa Schilling 2 1 Department of Pediatrics, University of Colorado,
The HMO Research Network (HMORN) is a well established alliance of 18 research departments in the United States and Israel. Since 1994, the HMORN has conducted.
Click to edit Master title style Health Information Technology: Driving Improvements in Medicaid Don Imholz Executive Vice President and Chief Information.
Shaping a Health Statistics Vision for the 21 st Century 2002 NCHS Data Users Conference 16 July 2002 Daniel J. Friedman, PhD Massachusetts Department.
Insurance Continuity and Receipt of Diabetes Preventive Care in Oregon’s Community Health Centers.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Health Literacy Integrating Health Literacy, Cultural Competency, & Language Access Services Health Literacy Michael Wolf, PhD MPH.
Program Evaluation Principles and Applications PAS 2010.
A NEW REIMBURSEMENT STRUCTURE FOR AMERICA ADVANCED DISEASE CONCEPTS.
RTI International is a trade name of Research Triangle Institute Nancy Berkman, PhDMeera Viswanathan, PhD
Is for Epi Epidemiology basics for non-epidemiologists.
Research Methodology II Term review. Theoretical framework  What is meant by a theory? It is a set of interrelated constructs, definitions and propositions.
Evaluating Efforts to Support Collaborative Research: Lessons Learned from the AHRQ MCC Research Network Jessie Gerteis, MPH Abt Associates, Inc. 27 th.
Biomedical Informatics and Health. What is “Biomedical Informatics”?
Comparative Effectiveness Research (CER) and Patient- Centered Outcomes Research (PCOR) Presentation Developed for the Academy of Managed Care Pharmacy.
1 DATA Act Information Model Schema (DAIMS) Version 1.0 Briefing June 2016.
Stages of Research and Development
Presentation Developed for the Academy of Managed Care Pharmacy
Models for Data Quality and Data Quality Assessment
Chapter 15 Community As Client: Applying the Nursing Process
Presentation transcript:

Funding was provided by a contract from AcademyHealth. Additional support was provided by AHRQ 1R01HS (Scalable PArtnering Network for CER: Across Lifespan, Conditions, and Settings), AHRQ 1R01HS (Scalable Architecture for Federated Translational Inquiries Network), and NIH/NCRR Colorado CTSI Grant Number UL1 RR (Colorado Clinical and Translational Sciences Institute). A Pragmatic Model for Data Quality Assessment in Clinical Research Michael G. Kahn, M.D., Ph.D. Department of Pediatrics University of Colorado, Denver Colorado Clinical and Translational Sciences Institute Department of Clinical Informatics, The Children’s Hospital Electronic Data Methods Forum Methods Symposium 17-October-2011

Disclosures Presentation based on AcademyHealth supported paper: “A Pragmatic Framework for Single-Site and Multi-Site Data Quality Assessment in Electronic Health Record-Based Clinical Research” Michael G. Kahn *,1,2, Marsha A. Raebel 3,4, Jason M. Glanz 3,5, Karen Riedlinger 6, John F. Steiner 3 1. Department of Pediatrics, University of Colorado Anschutz Medical Center, Aurora Colorado 2. Colorado Clinical and Translational Sciences Institute, University of Colorado Anschutz Medical Center, Aurora Colorado 3. Institute for Health Research, Kaiser Permanente Colorado, Denver Colorado 4. School of Pharmacy, University of Colorado, Aurora, Colorado 5. Department of Epidemiology, Colorado School of Public Health, Aurora, Colorado 6. Northwest Kaiser Center for Health Research, Portland Oregon 2

What is the issue? Poor data quality can invalidate research findings –Cohort identification –Risk factors / exposures / confounders –Interventions –Outcomes Data quality in non-research settings even more problematic –Documentation practices –Workflow –Diligence to data quality Our focus: how to assess data quality systematically? 3

Why is a systematic data quality assessment framework useful? We all do various data quality reviews –We know what we’ve looked at –We may not know what we haven’t looked at A comprehensive evaluation of data quality is too resource intensive –Need to focus on aspects that matter –If needs changes, are existing DQ assessments adequate? 4

Key Features of this Presentation A comprehensive data quality framework adapted from information sciences for clinical research –Definitions of data quality Multi-dimensional Context-dependent Operationalize DQ assessments –Uses framework to ensure coverage A formative proposal – data quality meta-data tags 5

Data Quality Assessment Stages Stage 1: initial assessments of source data sets prior to analysis –Simple global analyses, visualizations, descriptive statistics –Both single-site and multi-site Stage 2: Study-specific analytic subsets with complex models and detailed data validations focused on dependent and independent variables. 6

7 A trivial example: Martial Status by Age Would this result be worrisome?

8 It’s tough being 6 years old…….

9 Should we be worried? No –Large numbers will swamp out effect of anomalous data or use trimmed data –Simulation techniques are insensitive to small errors Yes –Observed site variation may be driven by differences in data quality, not clinical practices –Genomic associations look for small signals (small differences in risks) amongst populations

Hyperkalemia with K+-sparing Agents 10

Comparative Temporal Trends: Serum Glucose 11

3. Final analytic data set Extraction from EMR Data quality assessments 1. Site level Data quality assessments Data merging 2. Multi-site level Data quality assessment lifecycles 12

Multi-site quality assessment workflows Many loops Many decisions (diamonds) 13

Data quality dimensions from the IS/CS literature Terms used in Information Sciences literature to describe data quality 14 Wand Y, Wang R. Anchoring data quality dimensions in ontological foundations. Comm ACM. 1996;39(11):86-95.

Defining data quality: The “Fit for Use” Model Borrowed from industrial quality frameworks –Juran (1951): “Fitness for Use” design, conformance, availability, safety, and field use Multiple adaptations by information science community –Not all adaptations are clearly specified –Not all adaptations are consistent –Not linked to measurement/assessment methods 15

The Wang and Strong Data Quality Model Interviews with broad set of data consumers Yielded 118 data quality features  Four categories  Fifteen dimensions Includes features of the data and the data system Our modification: Two data categories  Five dimensions 16 Wang, R. and D. Strong (1996). "Beyond accuracy: What data quality means to data consumers." J. Management Information Systems 12(4): 5-34.

17

How to measure data quality? Need to link conceptual framework with methods Maydanchik: Five classes of data quality rules –Attribute domain: validate individual values –Relational integrity: accurate relationships between tables, records and fields across multiple tables –Historical: time-vary data –State-dependent: changes follow expected transitions –Dependency: follow real-world behaviors 18 Maydanchik, A. (2007). Data quality assessment. Bradley Beach, NJ, Technics Publications.

Data Quality Assessment METHODS Five classes of data quality rules  30 assessment methods –Attribute domain rules (5 methods) –Relational integrity: (4 methods) –Historical: (9 methods) –State-dependent: (7 methods) –Dependency: (5 methods) 19 Time and change assessments dominate!!

Dimension 1: Attribute domain constraints 20

Dimension 2: Relational integrity rules 21

22

Dimension 4: State-dependent rules 23

Dimension 5:Attribute dependency rules 24

How to use this framework Determine which aspects of data quality matter most at Stage 1 –What is needed to support Stage 2 –What is doable with data sources? –What can the project afford to do? –What needs to be done once versus repeatedly Write up a data quality assessment plan –What’s in, what’s out –And why 25

Extra credit slides: A formative proposal President’s Council of Advisors on Science and Technology (PCAST) –Recommended mandatory “metadata” tags attached to all HIT data elements Metadata are descriptions of the data PCAST proposed tags: data provenance, privacy permissions/restrictions 26

27

Extra credit slides: A formative proposal CER community defines metadata tags that describe data quality for data elements and data sets –Simple distributions (mean, median, min, max, missingness, histograms) ala OMOP OSCAR –More comprehensive set of measures derived from this framework If you are interested in this concept, contact me! 28

Funding was provided by a contract from AcademyHealth. Additional support was provided by AHRQ 1R01HS (Scalable PArtnering Network for CER: Across Lifespan, Conditions, and Settings), AHRQ 1R01HS (Scalable Architecture for Federated Translational Inquiries Network), and NIH/NCRR Colorado CTSI Grant Number UL1 RR (Colorado Clinical and Translational Sciences Institute). A Pragmatic Model for Data Quality Assessment in Clinical Research Michael G. Kahn, M.D., Ph.D.