Evaluating the Use of Outbreak Detection Algorithms to Detect Tuberculosis Outbreaks in Scotland Ben Tait Dr Janet Stevenson.

Slides:



Advertisements
Similar presentations
In the name of GOD In the name of GOD.
Advertisements

Why Do You Need to Measure Both BER and MER on Digital Signals
1 Health Warning! All may not be what it seems! These examples demonstrate both the importance of graphing data before analysing it and the effect of outliers.
EVAL 6970: Meta-Analysis Vote Counting, The Sign Test, Power, Publication Bias, and Outliers Dr. Chris L. S. Coryn Spring 2011.
Hawawini & VialletChapter 7© 2007 Thomson South-Western Chapter 7 ALTERNATIVES TO THE NET PRESENT VALUE RULE.
Contents 7.2 Applications of Percentage in Banking and 7.3 Time-series Graphs 7.4 Interpreting and Analysing Data Collected 7.1 Arithmetic and Geometric.
Signaling & Network Control Dr. Eng. Amr T. Abdel-Hamid NETW 704 Winter 2008 Message Transfer Part 2 (MTP2)
Characterisation and Reliability Testing of THz Schottky Diodes Chris Price University of Birmingham, UK
Introduction to Survival Analysis PROC LIFETEST and Survival Curves.
Internal Quality Control (QC) for Medical Laboratories: An introduction Dr. Otto Panagiotakis and Dr. Alexander Haliassos ESEAP – Greek Proficiency Testing.
Tuberculosis in the UK 2013 report
Tuberculosis in the UK 2014 report
Quality management: SPC III – control charts for attributes
Chemometrics Method comparison
Diane Stockton Trend analysis. Introduction Why do we want to look at trends over time? –To see how things have changed What is the information used for?
The Spatial Scan Statistic. Null Hypothesis The risk of disease is the same in all parts of the map.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 7: Gathering Evidence for Practice.
Portfolio Management Lecture: 26 Course Code: MBF702.
Wisconsin Department of Health Services HIV/AIDS Surveillance Annual Review New diagnoses, prevalent cases, and deaths through December 31, 2013 April.
Confidentiality and Security Issues in ART & MTCT Clinical Monitoring Systems Meade Morgan and Xen Santas Informatics Team Surveillance and Infrastructure.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
Investigating the Accuracy and Robustness of the Icelandic Cod Assessment and Catch Control Rule A. Rosenberg, G. Kirkwood, M. Mangel, S. Hill and G. Parkes.
Fundamentals of Data Analysis Lecture 9 Management of data sets and improving the precision of measurement.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 A Feature Selection and Evaluation Scheme for Computer Virus Detection Olivier Henchiri and Nathalie Japkowicz School of Information Technology and Engineering.
Grobman, K. H. "Confirmation Bias." Teaching about. Developmentalpsychology.org, Web. 16 Sept Sequence Fits the instructor's Rule? Guess.
Use of Public Health Intelligence for Disease Outbreaks (PHIDO) to Enhance Provincial Routine Reportable Disease Surveillance in Manitoba Weimin Hu, MBBS.
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
Chapter 4 Control Charts for Measurements with Subgrouping (for One Variable)
Accuracy Based Generation of Thermodynamic Properties for Light Water in RELAP5-3D 2010 IRUG Meeting Cliff Davis.
Time series Model assessment. Tourist arrivals to NZ Period is quarterly.
Current Challenges and Future Developments in HTA in the UK Frances Macdonald, 23 rd September 2008 (A personal, Industry View)
TREATMENT TIME GUARANTEE M Brown May Aims of the session TTG and its Waiting Times context Measurement –Reasonable offer –Impact of Board decisions.
Materials and Methods GIS Development A GIS was constructed from historical records of known villages reporting human anthrax between the years 1937 and.
Issues concerning the interpretation of statistical significance tests.
1 Comparing multiple tests for separating populations Juliet Popper Shaffer Paper presented at the Fifth International Conference on Multiple Comparisons,
1 Basic epidemiological study designs and its role in measuring disease exposure association M. A. Yushuf Sharker Assistant Scientist Center for Communicable.
© Imperial College LondonPage 1 A method for estimating the cost of reducing the false alarm rate in multi- institution performance monitoring using CUSUM.
Chapter 8: Simple Linear Regression Yang Zhenlin.
INVESTIGATION of EPIDEMIC. LEARNING OBJECTIVES  Recognize trends of disease occurrence.  Recognize trends of disease occurrence.  Define epidemic and.
Statistical Tests We propose a novel test that takes into account both the genes conserved in all three regions ( x 123 ) and in only pairs of regions.
Quantifying efficiency of homogenisation methods Dr. Peter Domonkos COST HOME ES0601.
Session 5: Review of the Scientific Method and The ATS Development Process 1.
MODEL DIAGNOSTICS By Eni Sumarminingsih, Ssi, MM.
Copyright © 2008 Delmar. All rights reserved. Chapter 4 Epidemiology and Public Health Nursing.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Mingze Zhang, Mun Choon Chan and A. L. Ananda School of Computing
Understanding Epidemiology
DISCUSSION AND CONCLUSIONS
National Center for HIV/AIDS, Viral Hepatitis, STD, and TB Prevention
Polynomial and Rational Functions
Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study  Dr Timothy M Walker, MRCP, Camilla LC.
One-Way Analysis of Variance
Public Health Surveillance
Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations Greenland et al (2016)
Epidemiology MPH 531 Analytic Epidemiology Case control studies
Hanford Performance Indicator Forum Statistical Process Control June 2005 Steven S Prevette
Honors Statistics Review Chapters 4 - 5
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Interpreting Epidemiologic Results.
Student: Mallesham Dasari Faculty Advisor: Dr. Maggie Cheng
Using Whole Genome Sequencing Analysis in California
Quality Assessment The goal of laboratory analysis is to provide the accurate, reliable and timeliness result Quality assurance The overall program that.
Perspectives from Los Angeles County Tuberculosis Control Program
Backtesting.
Lab Results + RVCT Notification=
Presumptions Subgroups (samples) of data are formed.
Regulatory-Industry Statistics Workshop , 2019
Session 2.1: Geography and time across the HIS
Presentation transcript:

Evaluating the Use of Outbreak Detection Algorithms to Detect Tuberculosis Outbreaks in Scotland Ben Tait Dr Janet Stevenson

To use outbreak detection algorithms on Scottish data on Tuberculosis infections reported to the Enhanced Surveillance of Mycobacterial Infections (ESMI) scheme to retrospectively detect outbreaks in order to see if this method could be a useful adjunct to the current methods of contact tracing and tuberculosis genotyping. Aim

Cluster Selection

Between 2007 and 2011 there were 151 clusters of identical genotypes in Scotland, of which 51.6% only involved two individuals After analysing the size of clusters of TB across Scotland a minimum cluster size of four individuals was used. The largest clusters were chosen from each remaining health board, with the next three largest clusters also selected from the Greater Glasgow and Clyde health board. The inclusion of the additional clusters in NHS Greater Glasgow and Clyde was due to the large imbalance in the location of clusters, 10.4cases/100,00 compared to 5.5cases/100,000 in Lothian.

Clusters

Case Count For each cluster, the number of cases of TB with the cluster genotype, that were in the main health board, were calculated for each of a series of 37 overlapping two year time frames which moved forward by one month. Additionally the number of TB cases with the cluster genotype outside of the main health board was also calculated for each time frame. A graph was then conducted showing the number of TB cases in each time frame throughout the five years allowing comparison between cases within the health board and cases outwith the health board.

Case Count

Log Likelihood Ratio The higher the log likelihood ratio, the higher the likelihood that there is an unexpected concentration of the disease or genotyped cluster of TB in question. The ratio compares the rate of the genotype within an area (the number of cases of a specific genotype compared to the total cases), compared to the rate of genotype outside the geographical area. Uses a threshold value to signify an outbreak Despite the drawbacks this method has been shown to be effective, especially on diseases such as TB which is transmitted in small local groups. This method also has additional benefits, as the data is collected via health boards, which simplifies the data analysis and does not rely on historical data.

Log Likelihood Ratio

Case Count

Log Likelihood Ratio Conclusions The Log Likelihood Ratio detected three outbreaks amongst the largest clusters in Scotland. This ratio is based upon the fact that an outbreak will happen within a single health board. As the LLR depends on the number of TB cases of different genotypes, it takes a larger outbreak for the LLR to cross the threshold in Glasgow than it does in Highland or Grampian, as was the case in clusters C and D compared to cluster H. The LLR is useful when a genotype is uncommon outside the health board or the cases occur in a health board with few TB cases. This property is useful to avoid any false alarms, but does mean that LLR is less sensitive in the larger health boards.

CuSum Originally developed for detecting changes in industrial processes and is one of the most commonly used Statistical Process Control (SPC) methods in the surveillance of syndromic data. Primarily used to detect changes in disease trends earlier than they may be detected by simple case count observation. Uses historical data to estimate an average rate and then detects when the current rate rises over what is to be expected by normal fluctuating presence of disease. Concern that CuSums are prone to providing producing false positives especially in the case of rare diseases.

CuSum

Case Count

CuSum Conclusions Ideally the case numbers at the start should be either zero, or at the very least below the threshold level. This was not possible for most of the clusters data that was used. Nevertheless, the CuSum identified six outbreaks amongst the eight clusters; this higher level of sensitivity may be beneficial in the detection of future outbreaks. One of the ways that CuSum differs from Log likelihood ratios is that it does not take into account the cases outwith the health board. The parameters that were used in this study were derived from the Kammerer study Due to the time required for a culture to be confirmed, and then for a MIRU profile to be added this may affect the timeliness of the CuSum and other outbreak detection algorithms

Conclusions Comparing CuSum and Log Likelihood Ratio, we can see that CuSum is the more sensitive of the two outbreak detection algorithms. The Log Likelihood Ratio based upon the health board areas may be suitable in other situations but it does not appear to be able to detect rises in genotype clusters in the larger health boards nor does it detect rises when the genotype is common across multiple health boards. Using the more sensitive CuSum there is a higher risk of false alarms, although it was not possible to test this. Both CuSum and LLR have detected outbreaks amongst the selected clusters, however this can also be done by simply observing a case count graph, comparing numbers with a genotype inside a health board to those outwith a health board. Although neither of the statistical methods were completely successful the method of case counting could be a useful adjunct to monitoring if outbreaks are being brought under control appropriately

Future Work All studies found during the literature search applied outbreak detection algorithms retrospectively. However, a prospective study is currently being carried out by the CDC using these methods on tuberculosis data as cases are reported to the CDC. More research will be needed on this to fully determine whether these outbreak detection algorithms would indeed be useful in real life situations. Further work in evaluating more geo-spatial analysis which does not require the arbitrary boundaries of the health boards.

Acknowledgements This could not have been possible without the help of Alison Smith-Palmer and Dr Ian Laurenson