Evaluating Diagnostic Accuracy of Prostate Cancer Using Bayesian Analysis Part of an Undergraduate Research course Chantal D. Larose.

Slides:



Advertisements
Similar presentations
Bayes rule, priors and maximum a posteriori
Advertisements

THE CENTRAL LIMIT THEOREM
Estimation of Means and Proportions
Hypothesis testing Another judgment method of sampling data.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Psychology 290 Special Topics Study Course: Advanced Meta-analysis April 7, 2014.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
Sampling: Final and Initial Sample Size Determination
Receiver Operating Characteristic (ROC) Curves
Departments of Medicine and Biostatistics
1 Using Biostatistics to Evaluate Vaccines and Medical Tests Holly Janes Fred Hutchinson Cancer Research Center.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Evaluation of Screening and Diagnostic Tests.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Point and Confidence Interval Estimation of a Population Proportion, p
Darlene Goldstein 29 January 2003 Receiver Operating Characteristic Methodology.
REGRESSION AND CORRELATION
Inferences About Process Quality
Chapter 14 Inferential Data Analysis
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
1 When the population (seizure, consignment) is too large to be analyzed in its entirety: – because of limitations in time and/or resources (personnel,
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
Statistical Decision Theory
Model Inference and Averaging
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
PARAMETRIC STATISTICAL INFERENCE
Bayesian inference review Objective –estimate unknown parameter  based on observations y. Result is given by probability distribution. Bayesian inference.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Bayesian Analysis and Applications of A Cure Rate Model.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Day 3: Sampling Distributions. CCSS.Math.Content.HSS-IC.A.1 Understand statistics as a process for making inferences about population parameters based.
Maximum Likelihood - "Frequentist" inference x 1,x 2,....,x n ~ iid N( ,  2 ) Joint pdf for the whole random sample Maximum likelihood estimates.
Confidence intervals and hypothesis testing Petter Mostad
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
- 1 - Bayesian inference of binomial problem Estimating a probability from binomial data –Objective is to estimate unknown proportion (or probability of.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 07: BAYESIAN ESTIMATION (Cont.) Objectives:
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Ch15: Decision Theory & Bayesian Inference 15.1: INTRO: We are back to some theoretical statistics: 1.Decision Theory –Make decisions in the presence of.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 91 Lecture 9 Bayesian approaches for quantitative responses 9.1 Proof-of-concept studies outside oncology.
Objective Evaluation of Intelligent Medical Systems using a Bayesian Approach to Analysis of ROC Curves Julian Tilbury Peter Van Eetvelt John Curnow Emmanuel.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Sampling Distributions Statistics Introduction Let’s assume that the IQ in the population has a mean (  ) of 100 and a standard deviation (  )
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Naïve Bayes Classifier April 25 th, Classification Methods (1) Manual classification Used by Yahoo!, Looksmart, about.com, ODP Very accurate when.
Naive Bayes Classifier. REVIEW: Bayesian Methods Our focus this lecture: – Learning and classification methods based on probability theory. Bayes theorem.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Bayesian Estimation and Confidence Intervals Lecture XXII.
The simple linear regression model and parameter estimation
Bayes for Beginners Stephanie Azzopardi & Hrvoje Stojic
CONCEPTS OF HYPOTHESIS TESTING
Inference Concerning a Proportion
Continuous Random Variable
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Statistical NLP: Lecture 4
LECTURE 09: BAYESIAN LEARNING
LECTURE 07: BAYESIAN ESTIMATION
CS 594: Empirical Methods in HCC Introduction to Bayesian Analysis
Mathematical Foundations of BME Reza Shadmehr
Presentation transcript:

Evaluating Diagnostic Accuracy of Prostate Cancer Using Bayesian Analysis Part of an Undergraduate Research course Chantal D. Larose

Overview Introduction Three Ingredients to the Analysis ROC Curves Bayesian Analysis Results Discussion

Introduction Prostate cancer is the most common non-skin cancer in America ▫The risk of being diagnosed with prostate cancer increase with age. While 1 in 10,000 men under 40 years of age are diagnosed, the rate increases to 1 in 15 in men over 60. Normal ProstateProstate Cancer

Introduction America’s population is aging ▫In 2000, persons over the age of 65 made up 12.4% of the country’s population. The proportion is expected to increase. That means an increasing proportion of the population is at risk for prostate cancer. Accurate testing could help thousands of patients.

Three Ingredients to the Analysis Test - A simple, non-invasive procedure Blood test to measure Prostate-Specific Antigen (PSA) levels. Gold standard - Can be a complex, expensive, invasive Determines the presence or absence of prostate cancer. Our gold standard is a biopsy Covariate - Additional information May help us increase the accuracy of our prediction. Our covariate is patient age.

There are two main questions: 1.How good is the PSA test alone at predicting the presence of prostate cancer? 2.How good is the PSA test at predicting prostate cancer when combined with information about a patient’s age.

ROC Curves Receiver Operating Characteristic (ROC) curves plots ‘True Positive Rate’ versus ‘False Positive Rate’ ▫True Positive Rate: probability that Positive test result is correct ▫False Positive Rate: probability that Positive test result is incorrect

ROC Curves The area under the ROC curve (AUC) serves as a measure of overall accuracy. If the AUC equals… ▫1, the test is perfect every time ▫0.5, it is as good as a coin flip ▫0, the diagnosis is the opposite of the test result. We expect values between 0.5 and 1. ROC curve and its AUC

Bayesian Analysis: Three Key Parts The Prior Distribution ▫Knowledge about the target parameter before looking at the data. ▫One of two general types: Informative prior: Holds information we know or suspect to be true, based on previous experiments or expert knowledge. Noninformative prior: Holds very little information. Best when we do not have previous experience or expert knowledge.

Bayesian Analysis: Three Key Parts The Data ▫The data used in Bayesian analysis is represented with a likelihood. ▫The likelihood is combined with the prior distribution.

Bayesian Analysis: Three Key Parts The Posterior Distribution ▫After combining the prior information with the data, represented by the likelihood, the result is the posterior distribution. ▫The posterior represents a natural updating of the prior knowledge based on the information from the data.

Bayesian vs. Frequentist Analysis Bayesian analysis allows us to update our initial distribution assumption to account for the observed data. The analysis also provides us the flexibility of directly analyzing the entire posterior distribution of the target parameter. ▫Thus, we can look at the mean, median, and other statistics that are useful to the questions we want to answer.

Bayesian vs. Frequentist Analysis Frequentist ApproachBayesian Approach considers population parameters as fixed, unknown constants. It is assumed that all the randomness lies in the data. takes the opposite view. The parameters are considered random variables which have their own distribution of possible values. The data is the known information. All resulting error comes from the distribution parameters. One key difference between Frequentist and Bayesian methods is how population parameters are treated.

Bayesian vs. Frequentist Analysis When is Bayesian analysis more appropriate? ▫When a parameter of a distribution is itself a random variable, or when expert knowledge is available ▫Works well with small data sets, where maximum likelihood methods are not appropriate

Bayesian vs. Frequentist Analysis AdvantagesDisadvantages Likely to have an idea about the prior distribution. If expert knowledge is not available, we may use a non- informative prior. Flexibility in choosing prior distribution Works well with small data sets, where maximum likelihood methods are not appropriate The Two Bayesian Problem: different priors produce different posteriors. Addressed by noninformative prior.

Forming an ROC Curve A parametric approach, using binormal distribution T he curve is a function of parameters a and b. ▫The values of a and b are also functions of two parameters, Beta and Variance. ▫These values are determined by a prior assumption and information from the data. The trickle-down effect

Forming an ROC Curve In other words, there is a trickle-down effect from the data and first assumptions to the ROC curve. To calculate this trickle-down effect, we use Bayesian statistical analysis. The trickle-down effect

Results Our analysis starts with two noninformative priors ▫The majority of the information in the posterior will come from the data. The data is organized into three groups: ▫All patients ▫Younger patients only ▫Older patients only

Results Each group undergoes two analyses: ▫one uses only test data to predict the diagnosis, ▫one uses test and age data combined. Each analysis produces an ROC curve. All together, six curves are calculated.

Discussion Each ROC curve had a section which fell below the 50% accuracy line. There was very little difference between pairs of ROC curves for a single age group Differences in curves between older and younger patients were more pronounced. ▫This makes sense. Since younger patients have naturally low PSA levels, it is easier to detect high levels, and elevated levels are more likely due to cancer.

Discussion Our progress so far has brought more questions. Among these include: How would we use the Bayesian technique with an incomplete dataset (i.e. missing information for certain patients)? Why does each ROC curve dip below the 50% accuracy line? What effects do other covariates, such as ethnic background, have on the ROC curve?