Download presentation
Presentation is loading. Please wait.
Published byKelly Daniel Modified over 8 years ago
1
The Clinical Practice Research Datalink Methodological Challenges in using Routine Clinical Data Dr Alison Nightingale, University of Bath
2
What is the CPRD and how are data generated? Database challenges Methodological challenges caused by missing data Overview
3
Clinical Practice Research Datalink
5
15.6 million patients from 684 general practices since 1988 5,773,422,644 records in 10 separate tables 6TB storage currently required Big Data? 1,605,948,604 consultations 903,530,787 tests 1,423,921,076 prescriptions
6
Data loads Downloading datasets from CPRD online Time taken to run complex algorithms Analysis of datasets where the study population is large Processing and analysis challenges
7
Methodological challenges in using the CPRD
8
Data generated through routine clinical care Missing data: Data are censored to the left and right Symptoms / diagnoses / results that are not clinically relevant Characteristics of patients or tests that are ‘normal’ ‘Take as directed’ prescriptions Confirmation or exclusion of diagnoses CPRD: missing data
9
Incidence Prevalence Mortality Risk factors Disease epidemiology
10
Incidence Prevalence Mortality Risk factors Disease epidemiology
11
Prevalence = incidence * average disease duration Increasing incidence rates Awareness Changes in diagnostic criteria Decreasing mortality (increase in disease duration) Improved treatment Earlier diagnosis Artefact? : misclassification / relapsing-remitting disease Increasing prevalence
12
What is the truth? Adjustment? Disease epidemiology
13
Study to estimate the diagnostic accuracy of RF testing in primary care for the diagnosis of rheumatoid arthritis. Investigate the impact of negative RF tests on time to diagnosis of RA Accuracy of diagnostic tests Disease Test result PositiveNegative PositiveTPFP NegativeFNTN
14
Identified potential issue of missing data due to lack of clinical significance of a negative test Suspected the data were MNAR = issue with imputing Accuracy of diagnostic tests Disease Test result PositiveNegative PositiveTPFP NegativeFN (Seronegative) TN
15
Practice-year stratified data % tests with ‘null’ result value % positive test results Results from RF tests submitted to labs in Oxford indicated that on average 7% were positive. Maximum positive rate was 23% from a rheumatology clinic
16
Practice-year stratified data % tests with ‘null’ result value % positive test results MNAR Reference range MNAR Tests not recorded -ve results not recorded
17
Practice-year stratified data % tests with ‘null’ result value % positive test results MNAR Reference range MNAR Tests not recorded -ve results not recorded
18
Practice-year stratified data % tests with ‘null’ result value % positive test results MNAR Reference range MNAR Tests not recorded -ve results not recorded
19
What do we do with MNAR data? % tests with ‘null’ result value % positive test results MNAR Reference range MNAR Tests not recorded -ve results not recorded Tests and test results more likely to be MAR
20
No current statistical methods of identifying MNAR data Reference values not always available No current consensus on how to handle data that are MNAR Likely to be present throughout: Prescribing records – missing prescribing information Medical diagnoses – missing diagnoses Patient characteristics: smoking, alcohol, body mass index Data that are likely to be MNAR
21
Large population Cohort and nested case-control studies Post-marketing safety surveillance: adverse outcomes Comparative effectiveness: treatment outcomes Methodological issues surrounding timing of exposures and the use of cumulative drug exposures Confounding by indication Comparative effectiveness & safety
22
Professor Neil McHugh, Professor of Pharmacopidemiology and Consultant Rheumatologist Dr Gavin Shaddick, Reader in Statistics Dr Anita McGrogan, Lecturer in Pharmacoepidemiology Dr Rachel Charlton, Research Fellow Dr Alison Nightingale, Research Fellow Julia Snowball, Database manager and programmer Amelia Jobling, Research Assistant and PhD student (statistics) CPRD team at Bath
23
Thank you a.nightingale@bath.ac.uk
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.