1 Lecture 2 Screening and diagnostic tests Normal and abnormal Validity: “gold” or criterion standard Sensitivity, specificity, predictive value Likelihood.

Slides:



Advertisements
Similar presentations
Chapter 8 Flashcards.
Advertisements

Research Curriculum Session II –Study Subjects, Variables and Outcome Measures Jim Quinn MD MS Research Director, Division of Emergency Medicine Stanford.
Lecture 3 Validity of screening and diagnostic tests
ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,
Epidemiological study of mild cognitive impairment and pilot evaluation of methods of early dementia detection in Chinese community Yueqin Huang MD MPH.
Divisional Meeting 15 th January 2009 Streptococcal Pharyngitis: A Systematic Review of the Predictive Value of Signs and Symptoms and the External Validation.
Assessing risk of functional decline in emergency departments MS Bakken, MD PhD student X EAMA Advanced Postgraduate Course in Geriatrics Martigny, Switzerland,
12 June 2004Clinical algorithms in public health1 Seminar on “Intelligent data analysis and data mining – Application in medicine” Research on poisonings.
Critically Evaluating the Evidence: diagnosis, prognosis, and screening Elizabeth Crabtree, MPH, PhD (c) Director of Evidence-Based Practice, Quality Management.
1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.
Sensitivity, Specificity and ROC Curve Analysis.
PEBB Disease Burden Report PEBB Board of Directors August 21, 2007 Bdattach.10.
Concept of Measurement
Journal Club Alcohol and Health: Current Evidence July–August 2005.
Journal Club Alcohol and Health: Current Evidence September–October 2004.
By Dr. Ahmed Mostafa Assist. Prof. of anesthesia & I.C.U. Evidence-based medicine.
Mary Ganguli’s Slides March 13 th Meeting. Mild Cognitive Impairment A View from the Trenches.
Screening By building screening for symptoms of VCI into regular workflows or practice, health care providers are participating in Taking Action to address.
Chapter 7 Correlational Research Gay, Mills, and Airasian
EVIDENCE BASED MEDICINE
Cohort Studies Hanna E. Bloomfield, MD, MPH Professor of Medicine Associate Chief of Staff, Research Minneapolis VA Medical Center.
Thoughts on Biomarker Discovery and Validation Karla Ballman, Ph.D. Division of Biostatistics October 29, 2007.
Screening and Early Detection Epidemiological Basis for Disease Control – Fall 2001 Joel L. Weissfeld, M.D. M.P.H.
Screening Sherine Shawky, MD, Dr.PH Assistant Professor Public Health King Abdulaziz University College of Medicine
Measurement and Data Quality
報 告 者 王瓊琦. postpartum depression : identification of women at risk.
Statistics in Screening/Diagnosis
BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.
The Nature of Disease.
Multiple Choice Questions for discussion
1 Lecture 2: Types of measurement Purposes of measurement Types and sources of data Reliability and validity Levels of measurement Types of scale.
Instrumentation.
Diagnostic Cases. Goals & Objectives Highlight Bayesian and Boolean processes used in classic diagnosis Demonstrate use/misuse of tests for screening.
CHP400: Community Health Program- lI Research Methodology STUDY DESIGNS Observational / Analytical Studies Case Control Studies Present: Disease Past:
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Indices of Performances of CPRs Nicola.
Teaching Registrars Research Methods Variable definition and quality control of measurements Prof. Rodney Ehrlich.
Are hospital readmissions in the elderly preventable? Antonio Sarría-Santamera MD PhD Institute of Health Carlos III University of Alcalá DUKE-NUS HSSR.
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Diagnostic accuracy of the STRATIFY clinical.
Vanderbilt Sports Medicine How to practice and teach EBM Chapter 3 May 3, 2006.
Lecture 6: Reliability and validity of scales (cont) 1. In relation to scales, define the following terms: - Content validity - Criterion validity (concurrent.
Dr K N Prasad Community Medicine
Evaluation of Diagnostic Tests
EVIDENCE ABOUT DIAGNOSTIC TESTS Min H. Huang, PT, PhD, NCS.
Risk Assessment Farrokh Alemi, Ph.D.. Session Objectives 1.Discuss the role of risk assessment in the TQM process. 2.Describe the five severity indices.
Fear of Falling Among Seniors: Needs Assessment and Intervention Strategies Susan L. Murphy ScD, OTR World Federation of Occupational Therapy Conference.
Do Instrumental Activities of Daily Living Predict Dementia at 1- and 2- Year Follow-Up? Findings from the Development of Screening Guidelines and Diagnostic.
1 Risk Assessment Tests Marina Kondratovich, Ph.D. OIVD/CDRH/FDA March 9, 2011 Molecular and Clinical Genetics Panel for Direct-to-Consumer (DTC) Genetic.
Lecture 9: Analysis of intervention studies Randomized trial - categorical outcome Measures of risk: –incidence rate of an adverse event (death, etc) It.
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
Prediction statistics Prediction generally True and false, positives and negatives Quality of a prediction Usefulness of a prediction Prediction goes Bayesian.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
Unit 15: Screening. Unit 15 Learning Objectives: 1.Understand the role of screening in the secondary prevention of disease. 2.Recognize the characteristics.
Screening.  “...the identification of unrecognized disease or defect by the application of tests, examinations or other procedures...”  “...sort out.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
EVALUATING u After retrieving the literature, you have to evaluate or critically appraise the evidence for its validity and applicability to your patient.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Long-Term Outcomes after Acute Stroke Treatment Larry B. Goldstein, M.D. Professor of Medicine (Neurology) Center for Cerebrovascular Disease Center for.
بسم الله الرحمن الرحيم Community Medicine Lec -11-
© 2010 Jones and Bartlett Publishers, LLC. Chapter 12 Clinical Epidemiology.
Screening Tests: A Review. Learning Objectives: 1.Understand the role of screening in the secondary prevention of disease. 2.Recognize the characteristics.
Understanding Populations & Samples
Diagnostic Test Studies
Understanding Results
Roland C. Merchant, MD, MPH, ScD
What is Screening? Basic Public Health Concepts Sheila West, Ph.D.
What is Screening? Basic Public Health Concepts Sheila West, Ph.D.
Lecture 4 Study design and bias in screening and diagnostic tests
Evidence Based Diagnosis
Presentation transcript:

1 Lecture 2 Screening and diagnostic tests Normal and abnormal Validity: “gold” or criterion standard Sensitivity, specificity, predictive value Likelihood ratio ROC curves Bias: spectrum, verification, information

2 Clinical/public health applications screening: for asymptomatic disease (e.g., Pap test, mammography) case-finding: testing of patients for diseases unrelated to their complaint diagnostic: to help make diagnosis in symptomatic disease or to follow-up on screening test

3 Evaluation of screening and diagnostic tests Performance characteristics –test alone Effectiveness (on outcomes of disease): –test + intervention

4 Criteria for test selection Reproducibility Validity Feasibility Simplicity Cost Acceptability

5 Sources of variation: Biological or true variation between individuals within individuals (e.g., diurnal variation in BP) –“controlled” by standardizing time of measurement

6 Sources of variation: Measurement error random error vs systematic error (bias) method (measuring instrument) observer

7

8 Quality of measurements Validity (accuracy) –Does it measure what it is intended to? –Lack of bias Reproducibility (reliability, precision, consistency) of measurements

9 Examples of types of reproducibility Between and within observer (inter- and intra-observer variation) –May be random or systematic Regression toward the mean –Systematic error when subjects have extreme values (more likely to be in error than typical values)

10 Validity (accuracy) Criterion validity –concurrent –predictive Face validity, content validity: judgement of the appropriateness of content of measurement Construct validity: validity of underlying entity or theoretical construct

11 Normal vs abnormal Statistical definition –“Gaussian” or “normal” distribution Clinical definition –using criterion

12

13

14

15

16 Selection of criterion Concurrent –salivary screening test for HIV –history of cough more than 2 weeks (for TB) Predictive –APACHE (acute physiology and chronic disease evaluation) instrument for ICU patients –blood lipid level –maternal height

17

18 Sensitivity and specificity Assess correct classification of: People with the disease (sensitivity) People without the disease (specificity)

19 Predictive value More relevant to clinicians and patients Affected by prevalence

20 Choice of cut-point If higher score increases probability of disease Lower cut-point: –increases sensitivity, reduces specificity Higher cut-point: –reduces sensitivity, increases specificity

21 Considerations in selection of cut-point Implications of false positive results burden on follow-up services labelling effect Implications of false negative results Failure to intervene

22 Likelihood ratio Likelihood ratio (LR) = sensitivity 1-specificity Used to compute post-test odds of disease from pre-test odds: post-test odds = pre-test odds x LR pre-test odds derived from prevalence post-test odds can be converted to predictive value of positive test

23 Example of LR prevalence of disease in a population is 25% sensitivity is 80% specificity is 90%, pre-test odds = 0.25 = 1/ likelihood ratio = 0.80 =

24 Example of LR If prevalence of disease in a population is 25% pre-test odds = 0.25 = 1/ post-test odds = 1/3 x 8 = 8/3 predictive value of positive result = 8/3+8 = 8/11 = 73%

25 Receiver operating characteristic (ROC) curve Evaluates test over range of cut-points Plot of sensitivity against 1-specificity Area under curve (AUC) summarizes performance: –AUC of 0.5 = no better than chance

26

27 Spectrum bias Study population should be representative of population in which test will be used Is range of subjects tested adequate? –In population with low risk of outcome, sensitivity will be lower, specificity higher –In population with high risk of outcome, sensitivity will be higher, specificity lower Comorbidity may affect sensitivity and specificity

28 Verification bias results of test affect intensity of subsequent investigation increasing probability of detection of outcome in those with positive test result

29 Information bias Diagnosis is not blind to test result Improves test performance

30 Example: Screening seniors in the emergency department (ED) for risk of function decline High risk group Many not adequately evaluated or referred for appropriate services Development and validation of a brief screening tool to identify those at increased risk of functional decline and other adverse outcomes

31 Two multi-site studies in Montreal EDs Study 1: development of ISAR –Prospective observational cohort study –JAGS (1999) 47: Study 2: evaluation of 2-step intervention –randomized controlled trial –JAGS (2001) 49:

32 Common features of 2 studies 4 Montreal hospitals (2 participated in both studies) Patients aged 65+, community dwelling, English or French-speaking Exclusions: –cognitively impaired or severe illness with no proxy informant – language barrier (no English or French)

33 Differences between 2 studies: Study design Study 1 –Observational study –Follow-up at 3 and 6 months after ED visit Study 2 –Randomized controlled trial: 2-step intervention vs usual care –Randomization by day of visit –Follow-up at 1 and 4 months after ED visit

34 RESULTS: ISAR development Adverse health outcome defined as any of following during 6 months after ED visit >10% ADL decline Death Institutionalization

35 Scale development Selection of items that predicted all adverse health events Multiple logistic regression - “best subsets” analysis Review of candidate scales with clinicians to select clinically relevant scale

36 Identification of Seniors At Risk (ISAR) 1. Before the illness or injury that brought you to the Emergency, did you need someone to help you on a regular basis? (yes) 2. Since the illness or injury that brought you to the Emergency, have you needed more help than usual to take care of yourself? (yes) 3. Have you been hospitalized for one or more nights during the past 6 months (excluding a stay in the Emergency Department)? (yes) 4. In general, do you see well? (no) 5. In general, do you have serious problems with your memory? (yes) 6. Do you take more than three different medications every day? (yes) Scoring: (positive score shown in parentheses)

37

38 Other Outcomes Related to ISAR Source: Dendukuri et al, JAGS, in press Does ISAR score identify patients with current functional problems? –Self-reported premorbid function (OARS) –Function at home visit assessed by nurse 1-2 weeks after ED visit (SMAF)

39

40 Other Outcomes Related to ISAR Does ISAR predict adverse outcomes (other than functional decline) during the subsequent 5 or 6 months? –High hospital utilization (11+ days/5 months) –Frequent ED visits –Frequent community health center visits –Increase in depressive symptoms

41

42 Summary of data on performance Very good detection of patients with current functional problems and depression (AUC values ) Moderate ability to predict future adverse health events (functional decline) and health center utilization (AUC values around 0.7) Fair ability to predict future hospital and ED utilization (AUC values )

43 Comparison with other screening tools for patients admitted to hospital Source: McCusker et al, J Gerontol 2002; 57A: M Systematic literature review Predictors of functional decline (including nursing home admission) among hospitalized seniors Investigated individual risk factors and predictive indices

44 Predictive indices Inouye (1993): FD and NH at 3 mo –4 factors: decubitus ulcer, cognitive impairment, premorbid functional impairment, low social activity Mateev(1998): D/NH at 3 mo. –clinical targeting criteria

45 Predictive indices (cont) McCusker (1999): FD/NH/ D at 6 mo. –Identification of Seniors At Risk (ISAR): 6- item self-report questionnaire Narain (1988): NH at 6 mo –hand-developed algorithm based on residence, mental status, diagnosis

46 Predictive indices (cont) Rubenstein (1984): FD and NH at 12 mo –expected discharge location and diagnosis Sager (1996): FD at 3mo –Hospital Admission Risk Profile (HARP) (age, MMSE and IADL) Zureik (1997): NH at discharge –6-item index

47 Performance of 7 predictive indices for functional decline

48 Performance of predictive indices Moderate performance (AUC )