10 May 20101 Approaches to test evaluation Evan Sergeant AusVet Animal Health Services.

Slides:

Advertisements

Similar presentations

II. Potential Errors In Epidemiologic Studies Random Error Dr. Sherine Shawky.

Advertisements

Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.

Introduction to Confidence Intervals using Population Parameters Chapter 10.1 & 10.3.

Evaluating Diagnostic Accuracy of Prostate Cancer Using Bayesian Analysis Part of an Undergraduate Research course Chantal D. Larose.

Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.

1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.

Chapter 13: The Chi-Square Test

Iowa State University College of Veterinary Medicine Principles of diagnostic sampling – the “ bead game” Pilot training school in PRRS diagnostics, 2012.

Chapter Seventeen HYPOTHESIS TESTING

Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.

Point and Confidence Interval Estimation of a Population Proportion, p

Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.

Analysis of Simulation Input.. Simulation Machine n Simulation can be considered as an Engine with input and output as follows: Simulation Engine Input.

Estimating a Population Proportion

Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides

© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 13 Using Inferential Statistics.

1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.2 Estimating a Population Proportion Objective Find the confidence.

Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.

Chapter 14 Inferential Data Analysis

Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.

1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.

1 Chapter 20 Two Categorical Variables: The Chi-Square Test.

1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.

The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.

David Yens, Ph.D. NYCOM PASW-SPSS STATISTICS David P. Yens, Ph.D. New York College of Osteopathic Medicine, NYIT l PRESENTATION.

Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides

Selecting the Correct Statistical Test

Inference for regression - Simple linear regression

Hypothesis Testing:.

Chapter 8 Inferences Based on a Single Sample: Tests of Hypothesis.

Multiple Choice Questions for discussion

Single-Sample T-Test Quantitative Methods in HPELS 440:210.

Disease freedom- Prof.dr. Nihad Fejzić International Consultant 1: Disease Surveillance, Monitoring and Reporting; Contingency.

Lecture 4: Assessing Diagnostic and Screening Tests

Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.

Confidence Interval Estimation

Analysis and Visualization Approaches to Assess UDU Capability Presented at MBSW May 2015 Jeff Hofer, Adam Rauk 1.

Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.

Chapter 15 Data Analysis: Testing for Significant Differences.

Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”

Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.

Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.

Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.

April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.

The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Introduction to meta-analysis.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.4 Analyzing Dependent Samples.

Chapter 7 Inferences Based on a Single Sample: Tests of Hypotheses.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.

BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.

Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-5 Estimating a Population Variance.

Hierarchical Bayesian Model for Certification of a Country as “Free” From an Animal Disease Eric A. Suess, Dept. of Statistics, Calif. State Univ., Hayward.

CHAPTERS HYPOTHESIS TESTING, AND DETERMINING AND INTERPRETING BETWEEN TWO VARIABLES.

Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.

© Copyright McGraw-Hill 2004

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.

10 May Understanding diagnostic tests Evan Sergeant AusVet Animal Health Services.

Nihad Fejzić International consultant. Fresh water fish raised in tanks Farms with 2 to 20 tanks Each tank 1000 to 5000 fish Disease Present in the country.

1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.

Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.

Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.

Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.

Hypothesis Testing and Statistical Significance

Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.

CHAPTER 3 Key Principles of Statistical Inference.

Chapter 11: Test for Comparing Group Means: Part I.

Critical Appraisal Course for Emergency Medicine Trainees Module 5 Evaluation of a Diagnostic Test.

Markov Chain Monte Carlo in R

Sample Size Determination

John Loucks St. Edward’s University . SLIDES . BY.

Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine

Presentation transcript:

10 May Approaches to test evaluation Evan Sergeant AusVet Animal Health Services

Comparing tests  Kappa – how well tests agree  McNemar’s chi-sq – are tests significantly different?

Kappa  Expected no. both +ve = (157 x 155)/1122 = 21.7  Expected no. both -ve = (965 x 967)/1122 =  Total Agreement = 1052  Chance Agreement =  K=( )/( ) = 0.739

McNemar Chi-Squared McNemar's Chi-squared test with continuity correction McNemar's chi-squared = , df = 1, p-value = 1.724e-06

OJD AGID and ELISA ELISA AGID+–Total – Total  Enter data into epitools Application of diagnostic tests > compare 2 tests see kappa, McNemar’s and level of agreement

Kappa SE for kappa = Z(kappa)8.25 p(kappa) - one-tailed0 Proportion positive agreement Proportion negative agreement Overall proportion agreement McNemar's Chi sq p(Chi sq)0.4

Gold Standard Tests  Use tests with perfect sensitivity and/or specificity to identify the true disease status of the individual from which the samples were taken.  What are the advantages and disadvantages of this approach?

Gold Standards Tests  Advantages Known disease status, Relatively simple calculations  Disadvantages May not exist, or be prohibitively expensive Rare diseases may only allow small sample size Disease may not be present in the country? Difficult to get representative (or even comparable) samples of diseased/non-diseased individuals

Exercises  Calculate Se and Sp for OJD AGID using data provided in OJD_AGID_Data.xls Calculate confidence limits using epitools

Non-gold standard methods  Do not depend on determining true infection status of individual.  Rely on statistical approaches to calculate best fit values for Se and Sp.  Tests must satisfy some important assumptions.

Comparison with a known reference test  Assumptions Independence of tests Se/Sp of reference test is known.  For ~100% specific reference test, Se(new test) = Number positive both tests / Total number positive to the reference test

Culture vs Serology  Estimate sensitivity of culture and serology (as flock tests)  Serology followed-up by histopathology to confirm flock status  Both tests 100% specificity (as flock tests)  How would you estimate sensitivity for these test(s)  Which test has better Se? Is the difference significant? All FlocksSerology +ve-veTotal PFC+ve ve Total

Example  Se (PFC) = 58/63 = 92% (83% - 97%)  Se (Serology) = 58/95 = 61% (51% - 70%) Value Kappa SE for kappa = Z(kappa)11.49 p(kappa) - one-tailed0 Proportion positive agreement Proportion negative agreement Overall proportion agreement McNemar's Chi sq p(Chi sq)0

Estimation from routine testing data  test-positives are subject to follow-up and truly infected animals are identified and removed from the population  Can be used to estimate specificity when the disease is rare in the population of interest.  Sp = 1 – (Number of reactors / Total number tested)

Se and Sp of equine influenza ELISA  During the equine influenza outbreak in Australia, horses were tested by PCR and serology: to confirm infection; to demonstrate seroconversion and/or absence of infection >30 days later; As part of random and targeted surveillance for case detection, to confirm area status and for zone progression in presumed “EI free” areas.  How could you use the resulting data to estimate sensitivity and specificity of the ELISA?

Equine influenza ELISA  475 PCR-positive horses, 471 also positive on ELISA  1323 horses from properties in areas with no infection, 1280 ELISA negative  Analyse in Epitools Application of diagnostic tests> test evaluation against gold standard  Sergeant, E. S. G., Kirkland, P. D. & Cowled, B. D Field Evaluation of an equine influenza ELISA used in New South Wales during the 2007 Australian outbreak response. Preventive Veterinary Medicine, 92,

Point Estimate Lower 95% CL Upper 95% CL Sensitivity Specificity

Mixture modelling  Assumptions observed distribution of test results (for a test with a continuous outcome reading such as an ELISA) is actually a mixture of two frequency distributions, one for infected individuals and one for uninfected individuals  Opsteegh, M., Teunis, P., Mensink, M., Zuchner, L., Titilincu, A., Langelaar, M. & van der Giessen, J Evaluation of ELISA test characteristics and estimation of Toxoplasma gondii seroprevalence in Dutch sheep using mixture models. Preventive Veterinary Medicine.

Latent Class Analysis  What is Latent Class Analysis?  Maximum Likelihood  Bayesian

Maximum likelihood estimation  Assumptions The tests are independent conditional on disease status (the sensitivity [specificity] of one test is the same, regardless of the result of the other test); The tests are compared in two or more populations with different prevalence between populations; Test sensitivity and specificity are constant across populations; and There are at least as many populations as there are tests being evaluated.  TAGS software Hui, S. L. & Walter, S. D Estimating the error rates of diagnostic tests. Biometrics, 36,

TAGS  Open R – shortcut in root directory of stick  Open tags.R in text editor or word  Select all and copy/paste into R console  Type TAGS() and to run  Hui Walter example 2 tests for TB Test 1 = Mantoux Test 2 = Tine test

 Follow the prompts to enter data: Data set = new Name = test Number of tests = 2, Number of populations = 2 Reference population? = No (0) Enter results for each population from table below Best guesses use defaults Bootstrap CI = Yes (1000 iterations) Test 1Test 2Population 1Population Data

 $Estimations pre1 pre2 Sp1 Sp2 Se1 Se2 Est CIinf CIsup

Bayesian estimation  What is Bayesian estimation? Combines prior knowledge/belief (what you think you know) with data to give best estimate Incorporates existing knowledge on parameters (Se, Sp, prevalence) “Priors” entered as probability (usually Beta) distributions Uses Monte Carlo simulation to solve Outputs also as probability distributions Can get very complex  Assumptions Independence of the tests Appropriate prior distributions chosen. Need information on prior probabilities Some methods can adjust for correlated tests Multiple tests in multiple populations

 Methods EpiTools (only allows one population so must have good information on one or more test characteristics) WinBUGS models

Bayesian analysis surra data Test 2 Test 1ELISA CATT+ve-veTotal +ve039 -ve0251 Total0290 Inputs for Bayesian analysis for revised sensitivity and specificity estimates Prior distributions for Bayesian analysis xnalphabeta Prev 11 Se_CATT (81%) Sp_CATT (99.4%) Se_ELISA_2 (75%) Sp_ELISA_2 (97.5%)

EpiTools  Run EpiTools > Estimating true prevalence > Bayesian estimation with two tests  Enter parameters: Data from 2x2 table: 0, 39, 0, 251 Prevalence = Beta(1,1) (uniform = don’t know) Test 1 (CATT): Se = Beta(82, 20), Sp = Beta(160, 2) Test 2 (ELISA): Se = Beta(76, 26), Sp = Beta(118, 4) Starting values: 0, 38, 0, 245 Other values as defaults and click submit

Prevalence Sensitivity-1 Specificity-1 Sensitivity-2 Specificity-2 Minimum< % Median % Maximum Mean SD Iterations20000