Could statistical science have caught Harold Shipman earlier?

Slides:



Advertisements
Similar presentations
The Controlled Drugs (Supervision of Management and Use) Regulations (Northern Ireland) 2009.
Advertisements

Designing Experiments: Sample Size and Statistical Power Larry Leamy Department of Biology University of North Carolina at Charlotte Charlotte, NC
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
1 Health Warning! All may not be what it seems! These examples demonstrate both the importance of graphing data before analysing it and the effect of outliers.
Risk adjustment and other reporting issues Shalini Santhakumaran NDAU Statistician Imperial College London.
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Hypothesis Testing making decisions using sample data.
Departments of Medicine and Biostatistics
ODAC May 3, Subgroup Analyses in Clinical Trials Stephen L George, PhD Department of Biostatistics and Bioinformatics Duke University Medical Center.
Unintentional Fall Injuries and Deaths Among MA Older Adults, Ages 65 Years and Over Carrie Huisingh, MPH, Epidemiologist Holly Hackman, MD, MPH, Epidemiologist.
Find the Joy in Stats ? ! ? Walt Senterfitt, Ph.D., PWA Los Angeles County Department of Public Health and CHAMP.
Bayesian insights into health-care performance monitoring in the UK David Spiegelhalter MRC Biostatistics Unit, Cambridge Thanks to Nicky Best, Clare Marshall,
Opportunities for Bayesian analysis in evaluation of health-care interventions David Spiegelhalter MRC Biostatistics Unit Cambridge
Introduction to Hypothesis Testing
Chapter 11: Sequential Clinical Trials Descriptive Exploratory Experimental Describe Find Cause Populations Relationships and Effect Sequential Clinical.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
West Hertfordshire Hospitals NHS Trust Analysing mortality data derived from Secondary User Services.
Sample Size Determination
Type II Error, Power and Sample Size Calculations
Power and Non-Inferiority Richard L. Amdur, Ph.D. Chief, Biostatistics & Data Management Core, DC VAMC Assistant Professor, Depts. of Psychiatry & Surgery.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Measuring the Quality of Hospital Care Min Hua Jen Imperial College London.
BIOE 301 Lecture Seventeen. Guest Speaker Jay Brollier World Camp Malawi.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Primary Care Trigger Tool Manaia Health PHO Linda Holman Quality Leader.
Outcomes surveillance using routinely collected health data Paul Aylin Professor of Epidemiology and Public Health Dr Foster Unit at Imperial College London.
Office for National Statistics & Public Health Dr Azeem Majeed Office for National Statistics & University College London.
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
Analysis and presentation of quality indicators
Assignment # 2. Problem Set-up  Clearly describe the link between the biological hypothesis, the statistical hypothesis and the data collected.
The Shipman Inquiry Shipman 4 - Implications for Pharmacy: Controlled Drugs Presentation to the British Pharmaceutical Conference 2005 By Mandie Lavin.
1 The Use of Control Charts in Health Care Monitoring and Public Health Surveillance William H. Woodall William H. Woodall Department of Statistics Department.
Multiple Testing Matthew Kowgier. Multiple Testing In statistics, the multiple comparisons/testing problem occurs when one considers a set of statistical.
August 20, 2003FDA Antiviral Drugs Advisory Committee Meeting 1 Statistical Considerations for Topical Microbicide Phase 2 and 3 Trial Designs: A Regulatory.
Issues concerning the interpretation of statistical significance tests.
Community Outreach to Reduce Disparities in Cardiovascular & Diabetes Morbidity & Mortality in the South Bronx Michael Alderman, MD Michelle Johnson, MD,
© Imperial College LondonPage 1 A method for estimating the cost of reducing the false alarm rate in multi- institution performance monitoring using CUSUM.
KJ Carroll Dec 1 st 2009 Back to Basics: Explaining sample size in outcome trials, are statisticians doing a thorough job? Kevin J Carroll.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Postgraduate books recommended by Degree Management and Postgraduate Education Bureau, Ministry of Education Medical Statistics (the 2nd edition) 孙振球 主.
1 SMU EMIS 7364 NTU TO-570-N Control Charts Basic Concepts and Mathematical Basis Updated: 3/2/04 Statistical Quality Control Dr. Jerrell T. Stracener,
“Dr. Death” “The Angel of Death” “The police complain I’m boring. No mistresses, home abroad, money in Swiss banks; I’m normal. If that is boring, I am.”
THE ROLE OF SUBGROUPS IN CLINICAL TRIALS Ralph B. D’Agostino, Sr., PhD Boston University September 13, 2005.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Professor Peter Furness National Medical Examiner Past President, Royal College of Pathologists.
INTRODUCTION TO TESTING OF HYPOTHESIS INTRODUCTION TO TESTING OF HYPOTHESIS SHWETA MOGRE.
National Confidential Inquiry into Suicide and Homicide by People with Mental Illness Making Mental Health Care Safer Annual Report and 20 year Review.
Descriptive study design
The (possible) Future of Autopsy
Statistical Core Didactic
Interventional trials
Health Information Management A Case Study
Sample Size Estimation
Mortality and harm – Developing Board Assurance
Chapter 8: Inference for Proportions
Flu epidemiology in Scotland – season 2017/18
R. E. Wyllys Copyright 2003 by R. E. Wyllys Last revised 2003 Jan 15
Is Alcohol a Problem? -Setting the Scene
Cumulative sum techniques for assessing surgical results
Analysis of Variance ANOVA.
Late Thrombosis in Drug Eluting Stents (DES)
Acquiring Proficiency in Off-Pump Surgery: Traversing the Learning Curve, Reproducibility, and Quality Control  Gavin J. Murphy, MD, Chris A. Rogers,
Volume 373, Issue 9672, Pages (April 2009)
Chapter 3: Graphic Presentation
Joint Commissioning Strategy for Learning Disabilities 2019 – 2024 LeDeR Learning Disability Review of Mortality Learning for Change Jan Gates Tracey.
Consultant Clinical Biochemist
Chris A. Rogers, PhD, Barnaby C. Reeves, DPhil, Massimo Caputo, MD, J
Presentation transcript:

Could statistical science have caught Harold Shipman earlier? David Spiegelhalter MRC Biostatistics Unit Cambridge BA Festival of Science, September 6th 2005

Shipman background 1974: General Practitioner in Todmorden 1975: Left when convicted of drug offences 1977: Joins group practice in Hyde 1992: Sets up in single-handed practice Jan 1998: Positive report from Tameside audit group: ‘Keep up the good work!’

March 1998: local GP contacts coroner with suspicions July 1998: police contacted concerning a possibly forged will Sept 1998: Shipman arrested. Enquiries made concerning 192 deaths. Jan 2000: convicted of murdering 15 patients 2001: Report from Richard Baker for DoH 2001: Public Inquiry examines 401 deaths 2004: Shipman commits suicide in prison

Shipman Inquiry July 2002: 215 definite victims 45 probable

First comparative analyses by Baker compares - ‘Observed’ deaths: those at home or in his practice and certified by Shipman, with ‘Expected’ deaths: corresponding rates for (small sample of) local GPs.

(NB: Shipman Inquiry total of definite or probable victims: 189 female > 65, 55 male over 65)

Sequential monitoring Accumulates evidence of ‘excess risk’ Simplest method is a cumulative plot of Observed - Expected (O – E) Good intuitive plot But when should we ‘blow the whistle’?

The Bristol Inquiry into excess mortality

But when to sound an alarm?

How soon can we tell if we are ‘off-target’?

Meanwhile, back in 1943 A group of statisticians called SR17 were working in the Ministry of Supply Included Barnard, Lindley, Plackett, Armitage etc Barnard developed the `Sequential probability ratio test’ Simultaneously discovered by Abraham Wald in the US George Barnard later wrote British Standard 3704 for condoms

Sequential probability ratio test (SPRT) Most powerful sequential test between two hypotheses H0 and H1 Based on log(likelihood ratio) LLR = log [ p(data| H1) / p(data| H0) ] Straightforward to adapt to ‘risk-adjustment’ model Horizontal thresholds set by required error rates

Risk-adjusted SPRT for count data Suppose observe O adverse events E are expected under H0 Alternative hypothesis H1: risk ratio r Contribution to log(likelihood ratio) is LLR = O log r - ( r -1) E Suppose we want to detect a doubling of risk (r = 2), then LLR = O log 2 - E i.e. plot 0.69 O – E , rather than O - E

For Bristol data

Setting thresholds = Probability of eventually rejecting H0 when it is true (Type I error) = Probability of eventually rejecting H1 when it is true (Type II error). Upper threshold: a = log [  / ( 1-  ) ] Lower threshold: b = log [ (1-) /  ]

Using Cardiac Surgical Register data (CSR), crossed 0.001 threshold in 1991

What threshold to choose? For a single series, perhaps set =  = 0.1 for ‘alert’, =  = 0.01 for ‘alarm’ ? But what if we are monitoring many GPs? Greatly increases risk of one GP crossing threshold by chance alone – problem of ‘multiple testing’ There are 27,000 GPs in England Perhaps use =  = 0.000001 ?

Shipman: older females would have set off ‘alarm’ in 1985, after only 40 deaths

But is this an appropriate comparison? Excess of home death certificates could be due to a good terminal care programme by a caring GP

Second comparative analyses by Baker O = All deaths in Shipman’s practice : ‘Expecteds’ from ONS data. Special linkage required as deaths not routinely linked to general practice. But overall death rates will only be changed by substantial reductions in length of life

Problems with this analysis? SPRT ‘built up credit’ while Shipman’s performance seemed reasonable Overall death rates will only be changed by substantial reductions in length of life, not by killing terminally ill patients

How might monitoring work in practice, using all death certificates? Group at Imperial College commissioned by Shipman Inquiry Mortality data from 1993 on 1000 GPs Very limited age-sex adjustment Huge variation in mortality rates found among GPs ( multiplicative over-dispersion model ) Risk-adjusted CUSUMs used (like SPRT but avoids problem of ‘building up credit’)

‘Signalling’ GPs were subsequently investigated

So what is happening now? “The Department of Health must take the lead in developing a national system for monitoring GP patient mortality rates” “It is disappointing that the arrangements for the monitoring and analysis of mortality data that were envisaged at the time of the Inquiry seminar have not progressed” Dame Janet Smith – Fifth report, Shipman Inquiry

References P Aylin, NB Best, A Bottle, EC Marshall EC (2003) Following Shipman: a pilot system for monitoring mortality rates in primary care. Lancet 362, 485-491 R Baker. Harold Shipman’s clinical practice 1974-1998: a Review Commissioned by the Chief Medical Officer. London: The Stationery Office, 2001. The Shipman Inquiry. www.the-shipman-inquiry.org.uk/ DJ Spiegelhalter, R Kinsman, O Grigg and T Treasure. (2003) Risk-adjusted sequential probability ratio tests: applications to Bristol, Shipman, and adult cardiac surgery. International Journal for Quality in Health Care 15:7–13 DJ Spiegelhalter and NG Best (2004) Harold Shipman’s statistical legacy. Significance, 1, 10-12 SH Steiner, RJ Cook, VT Farewell, T Treasure (2000) Monitoring surgical performance using risk-adjusted cumulative sum charts. Biostatistics 1: 441-552