Parametric modelling of cost data: some simulation evidence Andrew Briggs University of Oxford Richard Nixon MRC Biostatistics Unit, Cambridge Simon Dixon.

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

Statistical Estimation and Sampling Distributions
What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.
Chapter 10: Sampling and Sampling Distributions
Overview Revising z-scores Revising z-scores Interpreting z-scores Interpreting z-scores Dangers of z-scores Dangers of z-scores Sample and populations.
Jump to first page STATISTICAL INFERENCE Statistical Inference uses sample data and statistical procedures to: n Estimate population parameters; or n Test.
Sociology 601: Class 5, September 15, 2009
Chapter Six Sampling Distributions McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Statistical inference
Topic 2: Statistical Concepts and Market Returns
Lesson #17 Sampling Distributions. The mean of a sampling distribution is called the expected value of the statistic. The standard deviation of a sampling.
2. Point and interval estimation Introduction Properties of estimators Finite sample size Asymptotic properties Construction methods Method of moments.
Chapter 7: Variation in repeated samples – Sampling distributions
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
Modelling health care costs: practical examples and applications Andrew Briggs Philip Clarke University of Oxford & Daniel Polsky Henry Glick University.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Statistical Theory; Why is the Gaussian Distribution so popular? Rob Nicholls MRC LMB Statistics Course 2014.
SAMPLING DISTRIBUTION
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Topic 5 Statistical inference: point and interval estimate
1 SAMPLE MEAN and its distribution. 2 CENTRAL LIMIT THEOREM: If sufficiently large sample is taken from population with any distribution with mean  and.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
Random Sampling, Point Estimation and Maximum Likelihood.
Advanced Higher Statistics Data Analysis and Modelling Hypothesis Testing Statistical Inference AH.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
Chapter 7 Point Estimation
Review of Chapters 1- 6 We review some important themes from the first 6 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Sampling Distribution and the Central Limit Theorem.
6 - 1 © 1998 Prentice-Hall, Inc. Chapter 6 Sampling Distributions.
Section 6-5 The Central Limit Theorem. THE CENTRAL LIMIT THEOREM Given: 1.The random variable x has a distribution (which may or may not be normal) with.
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
Experimental Design Experimental Designs An Overview.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
LECTURE 25 THURSDAY, 19 NOVEMBER STA291 Fall
HYPOTHESIS TESTING Distributions(continued); Maximum Likelihood; Parametric hypothesis tests (chi-squared goodness of fit, t-test, F-test) LECTURE 2 Supplementary.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
1 Sampling Distribution of Arithmetic Mean Dr. T. T. Kachwala.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Lesson Distribution of the Sample Mean. Objectives Understand the concept of a sampling distribution Describe the distribution of the sample mean.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
6 - 1 © 2000 Prentice-Hall, Inc. Statistics for Business and Economics Sampling Distributions Chapter 6.
Sampling Distributions
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Selecting Input Probability Distributions. 2 Introduction Part of modeling—what input probability distributions to use as input to simulation for: –Interarrival.
Chapter 4. The Normality Assumption: CLassical Normal Linear Regression Model (CNLRM)
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Stat 223 Introduction to the Theory of Statistics
Advanced Higher Statistics
Stat 223 Introduction to the Theory of Statistics
LECTURE 24 TUESDAY, 17 November
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Simon Thompson University of Cambridge
STAT 5372: Experimental Statistics
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Review of Statistical Inference
CHAPTER 15 SUMMARY Chapter Specifics
Sampling Distribution Models
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
How Confident Are You?.
Presentation transcript:

Parametric modelling of cost data: some simulation evidence Andrew Briggs University of Oxford Richard Nixon MRC Biostatistics Unit, Cambridge Simon Dixon University of Sheffield Simon Thompson MRC Biostatistics Unit, Cambridge 2003 CHEBS Seminar, Friday 7 th November

Parametric modelling of cost data: Background Cost data are typically non-normally distributed, with high skew and kurtosis Arithmetic mean cost is of interest to policy makers Central Limit Theorem ensures sample mean is consistent estimator Commentators have proposed parametric modelling of cost data to improve efficiency In particular, Lognormal distribution commonly advocated Alternatively, Gamma distribution is an increasingly popular choice

Parametric modelling of cost data: Choice of estimator If data are Lognormal an efficient estimator of mean cost is: exp(lm+lv/2) If data are Gamma distributed the maximum likelihood estimate of the population mean is the sample mean

Parametric distributions: Simulation experiment Lognormal / Gamma distributions Population mean was set to be 1000 Five choices of coefficient of variation (CoV = 0.25, 0.5, 1.0, 1.5, 2.0) to define distribution parameters Samples of five different sizes (n = 20, 50, 200, 500, 2000) drawn from each distribution for each CoV 2 x 5 x 5 = 50 experiments Bias, coverage probability and RMSE all recorded

Parametric distributions: Distribution sets

Parametric distributions: Estimated RMSE from simulations

Parametric distributions: Estimated coverage probabilities

Empirical cost distributions: Summary statistics for 3 data sets Raw cost Log transformed cost

Empirical cost distributions: Data set 1: CPOU Raw cost Log transformed cost

Empirical cost distributions: Data set 2: IV Fluids Raw cost Log transformed cost

Empirical cost distributions: Data set 3: Paramedics Raw cost Log transformed cost

Empirical cost data sets: Simulation results

Parametric cost modelling: Comments & conclusions “All models are wrong” (Box 1976) “No data are normally distributed” (Nester 1996) Costs are estimated from resource use times unit cost Any parametric assumption relating to costs is at best an approximation Simulations confirm that there are efficiency gains if appropriate distribution is chosen But incorrect assumptions can lead to very misleading conclusions Sample mean performs well and is unlikely to lead to inappropriate inference Only when there are sufficient data to permit detailed modelling is the choice of an alternative estimator warrented