Sensitivity, Specificity and ROC Curve Analysis.

Slides:



Advertisements
Similar presentations
Lecture 3 Validity of screening and diagnostic tests
Advertisements

...visualizing classifier performance in R Tobias Sing, Ph.D. (joint work with Oliver Sander) Modeling & Simulation Novartis Pharma AG 3 rd BaselR meeting.
Evaluation of segmentation. Example Reference standard & segmentation.
Curva ROC figuras esquemáticas Curva ROC figuras esquemáticas Prof. Ivan Balducci FOSJC / Unesp.
Logistic Regression.
Receiver Operating Characteristic (ROC) Curves
Lecture 16: Logistic Regression: Goodness of Fit Information Criteria ROC analysis BMTRY 701 Biostatistical Methods II.
What Happens to the Performance of a Diagnostic Test when the Disease Prevalence and the Cut-Point Change? Pathological scores Healthy scores Healthy population.
GerstmanChapter 41 Epidemiology Kept Simple Chapter 4 Screening for Disease.
Anthropometry Technique of measuring people Measure Index Indicator Reference Information.
Performance measures Morten Nielsen, CBS, BioCentrum, DTU.
Introduction to Biomedical Statistics. Signal Detection Theory What do we actually “detect” when we say we’ve detected something?
Chapter 9.2 ROC Curves How does this relate to logistic regression?
Chapter 9.2 ROC Curves How does this relate to logistic regression?
Today Concepts underlying inferential statistics
Decision Tree Models in Data Mining
Thoughts on Biomarker Discovery and Validation Karla Ballman, Ph.D. Division of Biostatistics October 29, 2007.
What is Screening? Basic Public Health Concepts Sheila West, Ph.D. El Maghraby Professor of Ophthalmology Wilmer Eye Institute Johns Hopkins University.
Interpreting Diagnostic Tests
Measurement and Data Quality
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
David Yens, Ph.D. NYCOM PASW-SPSS STATISTICS David P. Yens, Ph.D. New York College of Osteopathic Medicine, NYIT l PRESENTATION.
BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.
SPH 247 Statistical Analysis of Laboratory Data May 19, 2015SPH 247 Statistical Analysis of Laboratory Data1.
What is Screening? Basic Health promotion Concepts Health promotion national conference 2010 Poster Presentation supervised by Dr Aidah Al Kaissi, RN,BSN,MD,PhD.
Stats Tutorial. Is My Coin Fair? Assume it is no different from others (null hypothesis) When will you no longer accept this assumption?
1 Lecture 2 Screening and diagnostic tests Normal and abnormal Validity: “gold” or criterion standard Sensitivity, specificity, predictive value Likelihood.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Basic statistics 11/09/13.
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Indices of Performances of CPRs Nicola.
Statistics for Health Care Biostatistics. Phases of a Full Clinical Trial Phase I – the trial takes place after the development of a therapy and is designed.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 2: Diagnostic Classification.
1 Interpreting Diagnostic Tests Ian McDowell Department of Epidemiology & Community Medicine January 2012 Note to readers: you may find the additional.
Screening and Diagnostic Testing Sue Lindsay, Ph.D., MSW, MPH Division of Epidemiology and Biostatistics Institute for Public Health San Diego State University.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Evaluation of Diagnostic Tests
EVIDENCE ABOUT DIAGNOSTIC TESTS Min H. Huang, PT, PhD, NCS.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
1 Epidemiological Measures I Screening for Disease.
MEASURES OF TEST ACCURACY AND ASSOCIATIONS DR ODIFE, U.B SR, EDM DIVISION.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Evaluating Diagnostic Tests Payam Kabiri, MD. PhD. Clinical Epidemiologist Tehran University of Medical Sciences.
Appraising A Diagnostic Test
Chapter 10 Screening for Disease
1 Risk Assessment Tests Marina Kondratovich, Ph.D. OIVD/CDRH/FDA March 9, 2011 Molecular and Clinical Genetics Panel for Direct-to-Consumer (DTC) Genetic.
Evaluating Results of Learning Blaž Zupan
TESTING A TEST Ian McDowell Department of Epidemiology & Community Medicine January 2008.
Screening of diseases Dr Zhian S Ramzi Screening 1 Dr. Zhian S Ramzi.
1 Wrap up SCREENING TESTS. 2 Screening test The basic tool of a screening program easy to use, rapid and inexpensive. 1.2.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
Unit 15: Screening. Unit 15 Learning Objectives: 1.Understand the role of screening in the secondary prevention of disease. 2.Recognize the characteristics.
Screening.  “...the identification of unrecognized disease or defect by the application of tests, examinations or other procedures...”  “...sort out.
Sigmoidal Response (knnl558.sas). Programming Example: knnl565.sas Y = completion of a programming task (1 = yes, 0 = no) X 2 = amount of programming.
1 Medical Epidemiology Interpreting Medical Tests and Other Evidence.
10 May Understanding diagnostic tests Evan Sergeant AusVet Animal Health Services.
Evaluating Classification Performance
Laboratory Medicine: Basic QC Concepts M. Desmond Burke, MD.
ROC curve estimation. Index Introduction to ROC ROC curve Area under ROC curve Visualization using ROC curve.
Timothy Wiemken, PhD MPH Assistant Professor Division of Infectious Diseases Diagnostic Tests.
CHAPTER 3 Key Principles of Statistical Inference.
Sensitivity, Specificity, and Receiver- Operator Characteristic Curves 10/10/2013.
Screening Tests: A Review. Learning Objectives: 1.Understand the role of screening in the secondary prevention of disease. 2.Recognize the characteristics.
دکتر حمیدرضا صابری
Performance of a diagnostic test Tunisia, 31 Oct 2014
Class session 7 Screening, validity, reliability
بسم الله الرحمن الرحيم Clinical Epidemiology
What is Screening? Basic Public Health Concepts Sheila West, Ph.D.
How do we judge efficacy of a screening test?
What is Screening? Basic Public Health Concepts Sheila West, Ph.D.
ביצוע רגרסיה לוגיסטית. פרק ה-2
Presentation transcript:

Sensitivity, Specificity and ROC Curve Analysis

Criteria for Evaluating a Screening Test Validity : provide a good indication of who does and does not have disease -Sensitivity of the test -Specificity of the test Reliability : (precision): gives consistent results when given to same person under the same conditions Yield : Amount of disease detected in the population, relative to the effort -Prevalence of disease/predictive value

Validity of Screening Test (Accuracy) - Sensitivity: Is the test detecting true cases of disease? Ideal is 100%: 100% of cases are detected; =Pr(T+|D+) -Specificity: Is the test excluding those without disease? Ideal is 100%: 100% of non-cases are negative; =Pr(T-|D-) - See Gehlbach, Chp. 10

True Cases of Glaucoma YesNo IOP > 22:Yes50100 No (total) Sensitivity = 50% (50/100) False Negative=50% Specificity = 95% (1900/2000) False Positive=5% Example: Screening for Glaucoma using IOP

Consider: -The impact of high number of false positives: anxiety, cost of further testing -Importance of not missing a case: seriousness of disease, likelihood of re-screening Where do we set the cut-off for a screening test?

Yield from the Screening Test: Predictive Value Relationship between Sensitivity, Specificity, and Prevalence of Disease Prevalence is low, even a highly specific test will give large numbers of False Positives Predictive Value of a Positive Test (PPV): Likelihood that a person with a positive test has the disease Predictive Value of a Negative Test (NPV): Likelihood that a person with a negative test does not have the disease

True Cases of Glaucoma YesNo IOP > 22:Yes50100 No (total) Specificity = 95% (1900/2000) False Positive=5% Positive Predictive Value =33% (50/150) Screening for Glaucoma using IOP

How Good does a Screening Test have to be? IT DEPENDS - Seriousness of disease, consequences of high false positivity rate: - Rapid HIV test should have >90% sensitivity, 99.9% specificity -Screen for nearsighted children proposes 80% sensitivity, >95% specificity -Pre-natal genetic questionnaire could be 99% sensitive, 80% specific

Choosing a cut-point: receiver operating characteristic curves Situation where screening test yields results as a continuous value (e.g., intraocular pressure for glaucoma) Want to select a value above (or below) which to call “diseased” or “at risk” How do we select that value?

Non-diseased cases Diseased cases Test result value or subjective judgment of likelihood that case is diseased Threshold

12 Non-diseased cases Diseased cases Test result value or subjective judgment of likelihood that case is diseased More typically:

Threshold TP Fraction (sensitivity) FP Fraction (1-specificity) less aggressive mindset Non-diseased cases Diseased cases

Threshold moderate mindset Non-diseased cases Diseased cases TP Fraction (sensitivity) FP Fraction (1-specificity)

Threshold more aggressive mindset Non-diseased cases Diseased cases TP Fraction (sensitivity) FP Fraction (1-specificity)

Threshold Non-diseased cases Diseased cases Entire ROC curve TP Fraction (sensitivity) FP Fraction (1-specificity)

Entire ROC curve Reader Skill and/or Level of Technology chance line TP Fraction (sensitivity) FP Fraction (1-specificity) Highly discriminate (good) Somewhat discriminate (not as good) Non-informative (no better than chance) Use area under to curve (AUC) to judge discriminating ability. Gehlbach: want AUC>80%

Luke Neff: Refractory Burn Shock Data Logistic Regression and ROC Curve Analysis Response Profile Ordered Value PET Total Frequency Testing Global Null Hypothesis: BETA=0 TestChi-SquareDFPr > ChiSq Likelihood Ratio <.0001 Score <.0001 Wald

Luke Neff: Refractory Burn Shock Data Logistic Regression and ROC Curve Analysis Analysis of Maximum Likelihood Estimates ParameterDFEstimate Standard Error Wald Chi-Square Pr > ChiSq Intercept Admission Lactate Odds Ratio Estimates Effect Point Estimate 95% Wald Confidence Limits Admission Lactate

Luke Neff: Refractory Burn Shock Data Logistic Regression and ROC Curve Analysis Area Standard Error % Wald Confidence Limits

Pred ProbTrue PosTrue NegFalse Pos False NegSe1 - Sp Corresponds to lactate value of about 3.0 Point that Maximizes sum of sensitivity and specificity.