Judgement and Decision Making in Information Systems Diagnostic Modeling: Bayes’ Theorem, Influence Diagrams and Belief Networks Yuval Shahar, M.D., Ph.D.

Slides:



Advertisements
Similar presentations
Lecture 3 Validity of screening and diagnostic tests
Advertisements

Diagnostic Metrics Week 2 Video 3. Different Methods, Different Measures  Today we’ll continue our focus on classifiers  Later this week we’ll discuss.
Chapter 4 Pattern Recognition Concepts: Introduction & ROC Analysis.
Rulebase Expert System and Uncertainty. Rule-based ES Rules as a knowledge representation technique Type of rules :- relation, recommendation, directive,
Learning Algorithm Evaluation
Evaluation of segmentation. Example Reference standard & segmentation.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.
Module 6 “Normal Values”: How are Normal Reference Ranges Established?
Curva ROC figuras esquemáticas Curva ROC figuras esquemáticas Prof. Ivan Balducci FOSJC / Unesp.
Diagnostic Tests Patrick S. Romano, MD, MPH Professor of Medicine and Pediatrics Patrick S. Romano, MD, MPH Professor of Medicine and Pediatrics.
Receiver Operating Characteristic (ROC) Curves
Uncertainty Everyday reasoning and decision making is based on uncertain evidence and inferences. Classical logic only allows conclusions to be strictly.
Azita Kheiltash Social Medicine Specialist Tehran University of Medical Sciences Diagnostic Tests Evaluation.
Diagnosing – Critical Activity HINF Medical Methodologies Session 7.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Evaluation of Screening and Diagnostic Tests.
Assessing and Comparing Classification Algorithms Introduction Resampling and Cross Validation Measuring Error Interval Estimation and Hypothesis Testing.
Design of Health Technologies lecture 19 John Canny 11/14/05.
1 Bayesian Reasoning Chapter 13 CMSC 471 Adapted from slides by Tim Finin and Marie desJardins.
Darlene Goldstein 29 January 2003 Receiver Operating Characteristic Methodology.
1 The Expected Performance Curve Samy Bengio, Johnny Mariéthoz, Mikaela Keller MI – 25. oktober 2007 Kresten Toftgaard Andersen.
Lucila Ohno-Machado An introduction to calibration and discrimination methods HST951 Medical Decision Support Harvard Medical School Massachusetts Institute.
Judgment and Decision Making in Information Systems Probability, Utility, and Game Theory Yuval Shahar, M.D., Ph.D.
Judgment and Decision Making in Information Systems Computing with Influence Diagrams and the PathFinder Project Yuval Shahar, M.D., Ph.D.
Critiquing for Evidence-based Practice: Diagnostic and Screening Tests M8120 Columbia University Fall 2001 Suzanne Bakken, RN, DNSc.
Medical Decision-Support Systems Probabilistic Reasoning in Diagnostic Systems Yuval Shahar, M.D., Ph.D.
Screening Test for Occult Cancer 100 patients with occult cancer: 95 have "x" in their blood 100 patients without occult cancer: 95 do not have "x" in.
Statistics in Screening/Diagnosis
BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.
Judgment and Decision Making in Information Systems Introduction: Decision Analysis and Human Judgment Yuval Shahar, M.D., Ph.D.
Reasoning with Bayesian Networks. Overview Bayesian Belief Networks (BBNs) can reason with networks of propositions and associated probabilities Useful.
Multiple Choice Questions for discussion
Medical decision making. 2 Predictive values 57-years old, Weight loss, Numbness, Mild fewer What is the probability of low back cancer? Base on demographic.
Non-Traditional Metrics Evaluation measures from the Evaluation measures from the medical diagnostic community medical diagnostic community Constructing.
Midterm Review Rao Vemuri 16 Oct Posing a Machine Learning Problem Experience Table – Each row is an instance – Each column is an attribute/feature.
Soft Computing Lecture 17 Introduction to probabilistic reasoning. Bayesian nets. Markov models.
Diagnostic Testing Ethan Cowan, MD, MS Department of Emergency Medicine Jacobi Medical Center Department of Epidemiology and Population Health Albert Einstein.
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Indices of Performances of CPRs Nicola.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Reasoning with Bayesian Belief Networks. Overview Bayesian Belief Networks (BBNs) can reason with networks of propositions and associated probabilities.
CpSc 810: Machine Learning Evaluation of Classifier.
MEASURES OF TEST ACCURACY AND ASSOCIATIONS DR ODIFE, U.B SR, EDM DIVISION.
Appraising A Diagnostic Test
Likelihood 2005/5/22. Likelihood  probability I am likelihood I am probability.
Evidence-Based Medicine Diagnosis Component 2 / Unit 5 1 Health IT Workforce Curriculum Version 1.0 /Fall 2010.
Evaluating Results of Learning Blaž Zupan
Uncertainty Management in Rule-based Expert Systems
METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Lecture notes 9 Bayesian Belief Networks.
Prediction statistics Prediction generally True and false, positives and negatives Quality of a prediction Usefulness of a prediction Prediction goes Bayesian.
Screening of diseases Dr Zhian S Ramzi Screening 1 Dr. Zhian S Ramzi.
1 Wrap up SCREENING TESTS. 2 Screening test The basic tool of a screening program easy to use, rapid and inexpensive. 1.2.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
Decision Analytic Approaches for Evidence-Based Practice M8120 Fall 2001 Suzanne Bakken, RN, DNSc, FAAN School of Nursing & Department of Medical Informatics.
Clinical Decision Support 1 Historical Perspectives.
Diagnostic Test Characteristics: What does this result mean
Evaluating Classification Performance
DECISION MAKING TOOLS 1. Elements of Decision Problems 2.
Laboratory Medicine: Basic QC Concepts M. Desmond Burke, MD.
Evaluation of Diagnostic Tests & ROC Curve Analysis PhD Özgür Tosun.
Introduction on Graphic Models
ROC curve estimation. Index Introduction to ROC ROC curve Area under ROC curve Visualization using ROC curve.
Timothy Wiemken, PhD MPH Assistant Professor Division of Infectious Diseases Diagnostic Tests.
PTP 560 Research Methods Week 12 Thomas Ruediger, PT.
Diagnostic Likelihood Ratio Presented by Juan Wang.
Sensitivity, Specificity, and Receiver- Operator Characteristic Curves 10/10/2013.
Uses of Diagnostic Tests Screen (mammography for breast cancer) Diagnose (electrocardiogram for acute myocardial infarction) Grade (stage of cancer) Monitor.
Bayes’ Theorem Suppose we have estimated prior probabilities for events we are concerned with, and then obtain new information. We would like to a sound.
Probability and Statistics
Evaluating Results of Learning
Component 11: Configuring EHRs
Data Mining Classification: Alternative Techniques
Presentation transcript:

Judgement and Decision Making in Information Systems Diagnostic Modeling: Bayes’ Theorem, Influence Diagrams and Belief Networks Yuval Shahar, M.D., Ph.D.

Reasoning Under Uncertainty Example: Medical Diagnosis Uncertainty is inherent to medical reasoning – relation of diseases to clinical and laboratory findings is probabilistic –Patient data itself is often uncertain with respect to value and time –Patient preferences regarding outcomes vary –Cost of interventions and therapy can change Principles of diagnosis modeling and computation are identical in engineering, finance, scientific evidence

Test Characteristics Disease Test result Disease present Disease absent Total Positive True positive (TP) False positive (FP) TP+FP Negative False negative (FN) True negative (TN) FN+TN TP+FNFP+TN

Test Performance Measures The gold standard test: the procedure that defines presence or absence of a disease (often, very costly) The index test: The test whose performance is examined True positive rate (TPR) = Sensitivity: –P(Test is positive|patient has disease) = P(T+|D+) –Ratio of number of diseased patients with positive tests to total number of patient: TP/(TP+FN) True negative rate (TNR) = Specificity –P(Test is negative|patient has no disease) = P(T-|D-) –Ratio of number of nondiseased patients with negative tests to total number of patients: TN/(TN+FP)

Test Predictive Values Positive predictive value (PV+) = P(D|T+) = TP/(TP+FP) Negative predictive value (PV-) = P(D-|T-) = TN/(TN+FN)

Lab Tests: What is “Abnormal”?

The Cut-off Value Trade off Sensitivity and specificity depend on the cut off value between what we define as normal and abnormal Assume high test values are abnormal; then, moving the cut-off value to a higher one increases FN results and decreases FP results (i.e. more specific) and vice versa There is always a trade off in setting the cut-off point

Receiver Operating Characteristic (ROC) Curves: Examples

Receiver Operating Characteristic (ROC) Curves: Interpretation ROC curves summarize the trade-off between the TPR (sensitivity) and the false positive rate (FPR) (1-specificity) for a particular test, as we vary the cut-off threshold The greater the area under the ROC curve, the better (more sensitive, more specific) the index test we are considering

Application of Bayes’ Theorem to Diagnosis

Odds-Likelihood (Odds Ratio) Form of Bayes’ Theorem Odds = P(A)/(1-P(A)); P = Odds/(1+Odds) Post-test odds = pretest odds * likelihood ratio

Application of Bayes’ Theorem Needs reliable pre-test probabilities Needs reliable post-test likelihood ratios Assumes one disease only (mutual exclusivity of diseases) Can be used in sequence for several tests, but only if they are conditionally independent given the disease; then we use the post-test probability of T i as the pre- test probability for T i+1 (AKA Simple, or Naïve, Bayes)

Relation of Pre-Test and Post-Test Probabilities

Example: Computing Predictive Values Assume P(Downs Syndrom): – (A) 0.1% (age 30) – (B) 2% (age 45) Assume for amniocentesis: Sensitivity is 99%, Specificity is 99%, for Downs Syndrome For both cases, A and B: –PV+ = P(DS|Amnio+) = ?? –PV- = P(DS-|Amnio-) = ??

Predictive Values: Down Syndrom

Bayesian Diagnostic System Example: de Dombal’s Abdominal-Pain System (1972) Domain: Acute abdominal pain (7 possible diagnoses) Input: Signs and symptoms of patient Output: Probability distribution of diagnoses Method: Naïve Bayesian classification Evaluation: an eight-center study involving 250 physicians and 16,737 patients Results: –Diagnostic accuracy rose from 46 to 65% –The negative laparotomy rate fell by almost half –Perforation rate among patients with appendicitis fell by half –Mortality rate fell by 22% Results using survey data consistently better than the clinicians’ opinions and even the results using human probability estimates!

Influence Diagrams: An Alternative, Powerful Tool for Modeling Decisions A graphical notation for modeling situations involving multiple decisions, probabilities, and utilities Computationally: equivalent to decision trees Has several distinct advantages and disadvantages relative to decision trees

Influence Diagrams: Node Conventions Chance node Decision node Utility node Deterministic node

Link Semantics in Influence Diagrams Dependence link (possible probabilistic relationship) Information link Influence link “No-forgetting” link

Decision Trees: an HIV Example Decision node Chance node

Influence Diagrams: An HIV Example

The Structure of Influence Diagram Links

Belief Networks (Bayesian/Causal Probabilistic/Probabilistic Networks, etc) Disease Fever Sinusitis Runny nose Headache Influence diagrams without decision and utility nodes Gender

Link Semantics in Belief Networks Dependence Independence Conditional independence of B and C, given A B C A

Assessment Versus Observation Orders Usually it is most convenient to consider relationship between diagnoses and tests in assessment order: P(T+|D+) –This is also the easier order in which experts assess the probabilities or in which we can learn them from data However, in real life we need to compute the probability of diseases given observed findings, that is, in observation order: P(D+|T+) –That is the fashion in which diagnostic problems are typically presented, although less easy to compute Thus, we often need to reverse the dependence arc to solve the influence diagram or belief network (or to draw the corresponding decision tree)

Assessment Order Representation 0.8 T+ 0.2 T- 0.2 T+ 0.8 T- 0.4 D+ 0.6 D D T P(D  & T  )

Observation order Representation: Reversing the Arcs D D D D T T D T P(D  & T  )

Advantages of Influence Diagrams and Belief Networks: Modeling Implications Excellent modeling tool that supports acquisition from domain experts –Intuitive semantics (e.g., information and influence links) –Explicit representation of dependencies –Representation in assessment (not observation) order –very concise representation of large decision models

Advantages of Influence Diagrams and Belief Networks: Computational Implications “Anytime” algorithms available (using probability theory) to compute the distribution of values at any node given the values of any subset of the nodes (e.g., at any stage of information gathering) Explicit support for value of information computations

Disadvantages of Influence Diagrams and Belief Networks for Modeling The order of decisions (timing) might be obscured The precise relationship between decisions and available information is hidden within the nodes Highly asymmetric problems might be easier to represent as decision trees –Influence diagrams require using a lot of 0/1 probabilities to represent asymmetry (e.g., if the test is not done, the result is sure to be unknown)

Problems in Using Influence Diagrams and Belief Networks for Computations Explicit representation of dependencies often requires acquisition of joint probability distributions (P(A|B,C)) Computation is in general intractable (NP hard), making even moderate-sized problems hard to solve without specialized algorithms Solution of even a relatively simple influence diagram requires the use of a computer and specialized software

Examples of Successful Belief- Network Applications In clinical medicine: –Pathological diagnosis at the level of a subspecialized medical expert (Pathfinder) –Endocrinological diagnosis (NESTOR) In bioinformatics: –Recognition of meaningful sites and features in DNA sequences –Educated guess of tertiary structure of proteins