Decision theory and Bayesian statistics. Tests and problem solving Petter Mostad 2005.11.21.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Hypothesis Testing Steps in Hypothesis Testing:
Chapter 18 Statistical Decision Theory Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th.
Chapter 21 Statistical Decision Theory
Managerial Decision Modeling with Spreadsheets
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Decision theory and Bayesian statistics. More repetition Tron Anders Moger
Correlation and regression Dr. Ghada Abo-Zaid
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Evaluating Diagnostic Accuracy of Prostate Cancer Using Bayesian Analysis Part of an Undergraduate Research course Chantal D. Larose.
Introduction to Nonparametric Statistics
What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.
© 2003 Pearson Prentice Hall Statistics for Business and Economics Nonparametric Statistics Chapter 14.
Chapter 14 Analysis of Categorical Data
Chapter 12 Chi-Square Tests and Nonparametric Tests
The Simple Regression Model
Results 2 (cont’d) c) Long term observational data on the duration of effective response Observational data on n=50 has EVSI = £867 d) Collect data on.
Final Review Session.
Hypothesis Testing. Introduction Always about a population parameter Attempt to prove (or disprove) some assumption Setup: alternate hypothesis: What.
Topic 3: Regression.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Chapter 14 Inferential Data Analysis
1 Overview of Major Statistical Tools UAPP 702 Research Methods for Urban & Public Policy Based on notes by Steven W. Peuquet, Ph.D.
Nonparametrics and goodness of fit Petter Mostad
Nonparametric or Distribution-free Tests
Inferential Statistics
SIMPLE LINEAR REGRESSION
Marketing Research, 2 nd Edition Alan T. Shao Copyright © 2002 by South-Western PPT-1 CHAPTER 17 BIVARIATE STATISTICS: NONPARAMETRIC TESTS.
Overview of Major Statistical Tools UAPP 702 Research Methods for Urban & Public Policy Based on notes by Steven W. Peuquet, Ph.D. 1.
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
Statistics for clinical research An introductory course.
Statistical Decision Theory
Introduction: Why statistics? Petter Mostad
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
Research Methods in Human-Computer Interaction
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Analysis of variance Petter Mostad Comparing more than two groups Up to now we have studied situations with –One observation per object One.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Nonparametric Statistics aka, distribution-free statistics makes no assumption about the underlying distribution, other than that it is continuous the.
1 1 Slide Decision Theory Professor Ahmadi. 2 2 Slide Learning Objectives n Structuring the decision problem and decision trees n Types of decision making.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
© 2000 Prentice-Hall, Inc. Statistics Nonparametric Statistics Chapter 14.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
© Copyright McGraw-Hill CHAPTER 13 Nonparametric Statistics.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Ordinally Scale Variables
Confidence intervals and hypothesis testing Petter Mostad
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Academic Research Academic Research Dr Kishor Bhanushali M
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Ch15: Decision Theory & Bayesian Inference 15.1: INTRO: We are back to some theoretical statistics: 1.Decision Theory –Make decisions in the presence of.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
Sampling and estimation Petter Mostad
Nonparametric Statistics
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Handbook for Health Care Research, Second Edition Chapter 13 © 2010 Jones and Bartlett Publishers, LLC CHAPTER 13 Statistical Methods for Continuous Measures.
Analysis of variance Tron Anders Moger
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Inferential Statistics Assoc. Prof. Dr. Şehnaz Şahinkarakaş.
Non-parametric Tests Research II MSW PT Class 8. Key Terms Power of a test refers to the probability of rejecting a false null hypothesis (or detect a.
Y - Tests Type Based on Response and Measure Variable Data
SA3202 Statistical Methods for Social Sciences
Presentation transcript:

Decision theory and Bayesian statistics. Tests and problem solving Petter Mostad

Overview Statistical desicion theory Bayesian theory and research in health economics Review of tests we have learned about From problem to statistical test

Statistical decision theory Statistics in this course often focus on estimating parameters and testing hypotheses. The real issue is often how to choose between actions, so that the outcome is likely to be as good as possible, in situations with uncertainty In such situations, the interpretation of probability as describing uncertain knowledge (i.e., Bayesian probability) is central.

Decision theory: Setup The unknown future is classified into H possible states: s 1, s 2, …, s H. We can choose one of K actions: a 1, a 2, …, a K. For each combination of action i and state j, we get a ”payoff” (or opposite: ”loss”) M ij. To get the (simple) theory to work, all ”payoffs” must be measured on the same (monetary) scale. We would like to choose an action so to maximize the payoff. Each state s i has an associated probability p i.

Desicion theory: Concepts If action a 1 never can give a worse payoff, but may give a better payoff, than action a 2, then a 1 dominates a 2. a 2 is then inadmissible The maximin criterion The minimax regret criterion The expected monetary value criterion

Example No birdflu outbreak Small birdflu outbreak Birdflu pandemic No extra precautions Some extra precautions Vaccination of whole pop states actions

Decision trees Contains node (square junction) for each choice of action Contains node (circular junction) for each selection of states Generally contains several layers of choices and outcomes Can be used to illustrate decision theoretic computations Computations go from bottom to top of tree

Updating probabilities by aquired information To improve the predictions about the true states of the future, new information may be aquired, and used to update the probabilities, using Bayes theorem. If the resulting posterior probabilities give a different optimal action than the prior probabilities, then the value of that particular information equals the change in the expected monetary value But what is the expected value of new information, before we get it?

Example: Birdflu Prior probabilities: P(none)=95%, P(some)=4.5%, P(pandemic)=0.5%. Assume the probabilities are based on whether the virus has a low or high mutation rate. A scientific study can update the probabilities of the virus mutation rate. As a result, the probabilities for no birdflu, some birdflu, or a pandemic, are updated to posterior probabilities: We might get, for example:

Expected value of perfect information If we know the true (or future) state of nature, it is easy to choose optimal action, it will give a certain payoff For each state, find the difference between this payoff and the payoff under the action found using the expected value criterion The expectation of this difference, under the prior probabilities, is the expected value of perfect information

Expected value of sample information What is the expected value of obtaining updated probabilities using a sample? –Find the probability for each possible sample –For each possible sample, find the posterior probabilities for the states, the optimal action, and the difference in payoff compared to original optimal action –Find the expectation of this difference, using the probabilities of obtaining the different samples.

Utility When all outcomes are measured in monetary value, computations like those above are easy to implement and use Central problem: Translating all ”values” to the same scale In health economics: How do we translate different health outcomes, and different costs, to same scale? General concept: Utility Utility may be non-linear function of money value

Risk and (health) insurance When utility is rising slower than monetary value, we talk about risk aversion When utility is rising faster than monetary value, we talk about risk preference If you buy any insurance policy, you should expect to lose money in the long run But the negative utility of, say, an accident, more than outweigh the small negative utility of a policy payment.

Desicion theory and Bayesian theory in health economics research As health economics is often about making optimal desicions under uncertainty, decision theory is increasingly used. The central problem is to translate both costs and health results to the same scale: –All health results are translated into ”quality adjusted life years” –The ”price” for one ”quality adjusted life year” is a parameter called ”willingness to pay”.

Curves for probability of cost effectiveness given willingness to pay One widely used way of presenting a cost-effectiveness analysis is through the Cost- Effectiveness Acceptability Curve (CEAC) Introduced by van Hout et al (1994). For each value of the threshold willingness to pay λ, the CEAC plots the probability that one treatment is more cost-effective than another.

Review of tests Below is a listing of most of the statistical tests encountered in Newbold. It gives a grouping of the tests by application area For details, consult the book or previous notes!

One group of normally distributed observations Goal of test:Test statistic:Distribution: Testing mean of normal distribution, variance known standard normal: Testing mean of normal distribution, variance unknown t-fordelingen, n-1 frihetsgrader: Testing variance of normal population Chi-kvadrat, n-1 frihetsgrader

Comparing two groups of observations: matched pairs Assuming normal distributions, unknown variance: Compare means Sign test: Compare only which observations are largest S = the number of pairs with positive difference. Large samples (n>20): Wilcoxon signed rank test: Compare ranks and signs of differences T=min(T +,T - ); T + / T - are sum of positive/negative ranks Wilcoxon signed rank statistic (D 1, …, D n differences) Large samples:

Comparing two groups of observations: unmatched data Diff. between pop. means: Known variances Standard normal Diff. between pop. means: Unknown but equal variances Diff. between pop. means: Unknown and unequal variances Testing equality of variances for two normal populations Assuming identical translated distributions: test equal means: Mann Whitney U test Based on sum of ranks of obs. from one group; all obs. ranked together Standard normal (n>10) see book for d.f.

Comparing more than two groups of data One-way ANOVA: Testing if all groups are equal (norm.) Kruskal-Wallis test: Testing if all groups are equal Based on sums of ranks for each group; all obs. ranked together Two-way ANOVA: Testing if all groups are equal, when you also have blocking Two-way ANOVA with interaction: Testing if groups and blocking variable interact

Studying population proportions Test of population proportion in one group (large samples) Standard normal Comparing the population proportions in two groups (large samples) Standard normal (p 0 common estimate)

Regression tests Test of regression slope: Is it ? Test on partial regression coefficient: Is it ? Test on sets of partial regression coefficients: Can they all be set to zero (i.e., removed)?

Model tests Contingency table test: Test if there is an association between the two attributes in a contingency table Goodness-of-fit test: Counts in K categories, compared to expected counts, under H 0 Tests for normality: Bowman-Shelton Kolmogorov-Smirnov

Tests for correlation Test for zero population correlation (normal distribution) Test for zero correlation (nonparametric): Spearman rank correlation Compute ranks of x- values, and of y- values, and compute correlation of these ranks Special distribution

Tests for autocorrelation The Durbin-Watson test (based on normal assumption) testing for autocorrelation in regression data Special distribution The runs test (nonparametric), testing for randomness in time Counting the number of ”runs” above and below the median in the time series Special distribution, or standard normal for large samples

From problem to choice of method Example: You have the grades of a class of studends from this years statistics course, and from last years statistics course. How to analyze? You have measured the blood pressure, working habits, eating habits, and exercise level for 200 middleaged men. How to analyze?

From problem to choice of method Example: You have asked 100 married women how long they have been married, and how happy they are (on a specific scale) with their marriage. How to analyze? Example: You have data for how satisfied (on some scale) 50 patients are with their primary health care, from each of 5 regions of Norway. How to analyze?