7 November The 2003 CHEBS Seminar 1 The problem with costs Tony O’Hagan CHEBS, University of Sheffield.

Slides:



Advertisements
Similar presentations
Regression Eric Feigelson Lecture and R tutorial Arcetri Observatory April 2014.
Advertisements

Objectives 10.1 Simple linear regression
Bayesian Health Technology Assessment: An Industry Statistician's Perspective John Stevens AstraZeneca R&D Charnwood Bayesian Statistics Focus Team Leader.
Significance Tests Hypothesis - Statement Regarding a Characteristic of a Variable or set of variables. Corresponds to population(s) –Majority of registered.
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
Introduction to Statistics
Confidence intervals. Population mean Assumption: sample from normal distribution.
Estimating the Population Mean Assumptions 1.The sample is a simple random sample 2.The value of the population standard deviation (σ) is known 3.Either.
1 Equivalence and Bioequivalence: Frequentist and Bayesian views on sample size Mike Campbell ScHARR CHEBS FOCUS fortnight 1/04/03.
Value of Information Some introductory remarks by Tony O’Hagan.
Statistical Background
17/12/2002 CHEBS Launch Seminar CHEBS Activities and Plans Tony O’Hagan Director.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Sampling Distributions
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Chapter 11: Inference for Distributions
Costs Some introductory remarks by Tony O’Hagan. Welcome! Welcome to the fourth CHEBS dissemination workshop This forms part of our Focus Fortnight on.
Modelling health care costs: practical examples and applications Andrew Briggs Philip Clarke University of Oxford & Daniel Polsky Henry Glick University.
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
1 2. Reliability measures Objectives: Learn how to quantify reliability of a system Understand and learn how to compute the following measures –Reliability.
Overview of Robust Methods Analysis Jinxia Ma November 7, 2013.
Standard Error of the Mean
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
AM Recitation 2/10/11.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Sampling Theory Determining the distribution of Sample statistics.
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Stat 13, Thu 5/10/ CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When.
Estimates and Sample Sizes Lecture – 7.4
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
The COTOR Challenge Committee on the Theory of Risk November 2004 Annual Meeting.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Chapter 13 – Difference Between Two Parameters Math 22 Introductory Statistics.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Fall 2002Biostat Nonparametric Tests Nonparametric tests are useful when normality or the CLT can not be used. Nonparametric tests base inference.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
The Robust Approach Dealing with real data. Estimating Population Parameters Four properties are considered desirable in a population estimator:  Sufficiency.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Estimating a Population Mean
On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Simple examples of the Bayesian approach For proportions and means.
Sampling and estimation Petter Mostad
Estimating a Population Mean. Student’s t-Distribution.
1 CSI5388 Practical Recommendations. 2 Context for our Recommendations I This discussion will take place in the context of the following three questions:
Essential Statistics Chapter 171 Two-Sample Problems.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
In your groups, go through all four steps of a confidence interval: The “Country Taste” bread making company wants to estimate the actual weight of their.
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
Estimating standard error using bootstrap
Inferential Statistics Inferences from Two Samples
Math 4030 – 10a Tests for Population Mean(s)
Bayesian Inference for Small Population Longevity Risk Modelling
CHAPTER 21: Comparing Two Means
Problems: Q&A chapter 6, problems Chapter 6:
Elementary Statistics
Lesson Comparing Two Means.
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Inference for Distributions
PSY 626: Bayesian Statistics for Psychological Science
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
Essential Statistics Two-Sample Problems - Two-sample t procedures -
Estimates and Sample Sizes Lecture – 7.4
Introduction to the t Test
Chapter 8 Inferences from Two Samples
CONFIDENCE INTERVALS A confidence interval is an interval in which we are certain to a given probability of the mean lying in that interval. For example.
Presentation transcript:

7 November The 2003 CHEBS Seminar 1 The problem with costs Tony O’Hagan CHEBS, University of Sheffield

7 NovemberThe 2003 CHEBS Seminar2 A simple problem Given a sample from a population, how can we estimate the mean of that population? Sample mean? ›Unbiased and consistent ›But sensitive to extreme observations In Health Economics ›Costs are invariably very skewed ›Can also arise with times to events and other kinds of data ›And we really require inference about population means

7 NovemberThe 2003 CHEBS Seminar3 A simple dataset Costs incurred by 26 asthma patients in a trial comparing two inhalers ›Patients who used pMDI and had no exacerbations

7 NovemberThe 2003 CHEBS Seminar4 Some estimates & intervals Sample mean (use CLT to justify normality) ›Estimate 2104, 95% CI (-411,4619) Bayesian analysis assuming normality (weak prior) ›Posterior mean 2104, 95% credible interval (-411,4619) Nonparametric bootstrap ›Estimate 2104, 95% CI (298,4785) Bayesian bootstrap ›Posterior mean 2104, 95% credible interval (575,5049) Bayesian analysis assuming lognormality ›Posterior median 1112, 95% credible interval (510,3150)

7 NovemberThe 2003 CHEBS Seminar5 Which is right? 1.Data appear to fit lognormal much better than normal ›But many other distributions might visually fit well yet give completely different results 2.Results from analysis assuming normality are supported by bootstrap and by analysis based on CLT ›These are well known to be robust methods ›But bootstrapping the sample mean will always give the same estimate and will tend to back up the normal-theory analysis ›And extreme skewness evident in the population suggests non- robustness of the sample mean 3.Real cost distributions won’t follow any standard form

7 NovemberThe 2003 CHEBS Seminar6 The problem The population mean depends critically on the shape of the tail How can we learn about that tail from a small sample? Or even quite a large one?

7 NovemberThe 2003 CHEBS Seminar7 Bayesian model comparison Bayes factors for the example data ›Lognormal versus normal, ›Lognormal versus square-root normal, ›Lognormal is favoured over any other power transformation to normality ›Lognormal versus gamma, 10 3 This is far from conclusive ›Distributions we can’t distinguish could still have completely different tails

7 NovemberThe 2003 CHEBS Seminar8 Possible distributions Normal – unrealistic, very thin tailed Gamma – thin tailed (exponential) ›Sample mean is MVUE Lognormal – heavier tailed ›Population mean exists but its posterior mean may not Inverse gamma – heavy tailed (polynomial) ›Population mean exists if enough degrees of freedom

7 NovemberThe 2003 CHEBS Seminar9 Log-gamma, log-logistic – too heavy tailed? ›Population mean never exists Generalised Pareto – range of tail weights ›Used in extreme value theory Mixtures and chimeras ›More flexible and realistic ›Harder to fit ›Bayesian methods essential

7 NovemberThe 2003 CHEBS Seminar10 More complex structures We nearly always wish to compare means ›Extreme data can heavily influence comparison ›Asthma dataset We also often need to model costs in more complex ways ›Components of costs ›Covariates ›Tail shape can again be very influential

7 NovemberThe 2003 CHEBS Seminar11

7 NovemberThe 2003 CHEBS Seminar12 Recommendations Try a variety of models ›If sample size is large enough, answers may be robust to modelling assumptions Use prior information ›We need evidence of what kinds of distributions can arise in different situations ›And of how different they can be between different groups