11 Prior Distribution Elicitation for Generalized Linear and Piecewise-Linear Models Paul Garthwaite and Fadlalla Elfadaly Open University.

Slides:

Advertisements

Similar presentations

Handling attrition and non- response in longitudinal data Harvey Goldstein University of Bristol.

Advertisements

Design of Experiments Lecture I

ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.

Brief introduction on Logistic Regression

Comparing Two Proportions (p1 vs. p2)

Experiments and Variables

1 An Overview of Elicitation Methods and Software Paul Garthwaite Open University, UK (Joint work with Fadlalla Elfadaly)

Departments of Medicine and Biostatistics

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.

Assumption of normality

Model Assessment, Selection and Averaging

1) Introduction Prior to the Exxon Valdez oil spill, the estimation of passive use value, was an area of economic research not well known. However, based.

What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.

ANALYSIS OF REPEATED MEASURES COST DATA WITH ZERO OBSERVATIONS: An Application To The Costs Associated With Inflammatory Polyarthritis Nicola J Cooper,

Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.

Maximum likelihood estimates What are they and why do we care? Relationship to AIC and other model selection criteria.

Parameterising Bayesian Networks: A Case Study in Ecological Risk Assessment Carmel A. Pollino Water Studies Centre Monash University Owen Woodberry, Ann.

1 Quantifying Opinion about a Logistic Regression using Interactive Graphics Paul Garthwaite The Open University Joint work with Shafeeqah Al-Awadhi.

Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review

Clustered or Multilevel Data

(Correlation and) (Multiple) Regression Friday 5 th March (and Logistic Regression too!)

Modeling Gene Interactions in Disease CS 686 Bioinformatics.

Oscar Go, Areti Manola, Jyh-Ming Shoung and Stan Altan

Assumption of linearity

Analysis of Complex Survey Data

Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.

SAS Lecture 5 – Some regression procedures Aidan McDermott, April 25, 2005.

1 Formal Evaluation Techniques Chapter 7. 2 test set error rates, confusion matrices, lift charts Focusing on formal evaluation methods for supervised.

Inference for regression - Simple linear regression

Multiple Choice Questions for discussion

Modeling Menstrual Cycle Length in Pre- and Peri-Menopausal Women Michael Elliott Xiaobi Huang Sioban Harlow University of Michigan School of Public Health.

Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.

PTP 560 Research Methods Week 8 Thomas Ruediger, PT.

Fundamentals of Data Analysis Lecture 10 Management of data sets and improving the precision of measurement pt. 2.

Practical Statistical Analysis Objectives: Conceptually understand the following for both linear and nonlinear models: 1.Best fit to model parameters 2.Experimental.

Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.

Planning and Data Collection

University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.

LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.

Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.

MARE 250 Dr. Jason Turner Multiple Regression. y Linear Regression y = b 0 + b 1 x y = dependent variable b 0 + b 1 = are constants b 0 = y intercept.

4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.

Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.

Lecture 5 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.

The two way frequency table The  2 statistic Techniques for examining dependence amongst two categorical variables.

Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.

Question paper 1997.

Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.

Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.

A generalized bivariate Bernoulli model with covariate dependence Fan Zhang.

Assessing Responsiveness of Health Measurements Ian McDowell, INTA, Santiago, March 20, 2001.

1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.

Advanced Residual Analysis Techniques for Model Selection A.Murari 1, D.Mazon 2, J.Vega 3, P.Gaudio 4, M.Gelfusa 4, A.Grognu 5, I.Lupelli 4, M.Odstrcil.

Designing Factorial Experiments with Binary Response Tel-Aviv University Faculty of Exact Sciences Department of Statistics and Operations Research Hovav.

Tutorial I: Missing Value Analysis

Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.

Assumptions of Multiple Regression 1. Form of Relationship: –linear vs nonlinear –Main effects vs interaction effects 2. All relevant variables present.

Nonlinear Logistic Regression of Susceptibility to Windthrow Seminar 7 Likelihood Methods in Forest Ecology October 9 th – 20 th, 2006.

REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.

Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.

Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.

Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.

Uncertain Judgements: Eliciting experts’ probabilities Anthony O’Hagan et al 2006 Review by Samu Mäntyniemi.

Methods of Presenting and Interpreting Information Class 9.

Prior beliefs Prior belief is knowledge that one has about a parameter of interest before any events have been observed – For example, you may have an.

Fixed, Random and Mixed effects

New Techniques and Technologies for Statistics 2017 Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.

Clinical prediction models

Presentation transcript:

11 Prior Distribution Elicitation for Generalized Linear and Piecewise-Linear Models Paul Garthwaite and Fadlalla Elfadaly Open University

22 Why piecewise-linear models? Initial motivation for this model came from the need to model ecologists’ opinion about the presence/absence of rare and endangered animals. (A good example where expert opinion is useful – the ecologists had sightings of rare species but the data was not from a sampling frame and hence hard to incorporate in a statistical analysis.) For most variables there was an optimum value for a species. E.g. too hot or too cold did not suit it; nor too wet or too dry, etc.

3

4 Sampling Model Logistic model: y = ln( p/(1-p)) = β 0 + β 1 x 1 + …+ β k x k. GLM: y = g(μ) = β 0 + β 1 x 1 + …+ β k x k. Strategy: Elicit quantiles of p or μ and transform the assessments to quantiles of y. Prior model: β ~ multivariate normal. Three software implementations of the method: Garthwaite (1998: Visual Basic) Kynn (2004: Pascal. Elicitor) Elfadaly, Jenkinson, Garthwaite and Laney (2007/9: JAVA) (The programs of Garthwaite and Kynn only handle logistic regression.)

55 Assessments at reference point Scene-setting questions determine the number of variables and factors, their ranges and also a reference point. The reference point is chosen by the expert and gives the origin of variables and the reference level of factors. For a continuous variable it is assumed that opinion about slopes on one side of the reference point is independent of opinion about slopes on the other side. With the methods of Garthwaite and Elfadaly et al., median, lower and upper quartiles of the response at the reference point are assessed.

66 Lower and upper quartiles have the advantage that they can be assessed by the method of bisection. L M U 25% 25% 25% 25% _________________________________

7 Elicitor is much more flexible. For assessing the median, some techniques that can be used with logistic regression are available to the expert: Visual aids such as a probability wheel can be used. Probabilities can be given by first stating a (large) sample size and then assessing the number in that sample with the characteristic of interest. Scales marked in odds or log-odds can also be used. For credible intervals, intervals other than 50% intervals can be specified and a form of fixed interval method is also advocated.

88 Median Assessments Medians are assessed for one covariate at a time. The expert is asked to assume that all other covariates are at their reference values and to consider how the response varies with the covariate of current interest. The expert clicks on a graph to draw a curve for covariates or a bar chart for factors. (This is a poor approach to designing experiments but has clear benefits when eliciting expert opinion.)

9

10 The number of knots does not seem crucial. Elicitor gives the option of fitting a linear or quadratic function to the medians. Garthwaite (1998) gave option of superimposing graphs to help improve the expert’s internal consistency across covariates. (In forming models we almost always adopt linear relationships as the building blocks. Elicited piecewise linear relationships could instead be used as the building blocks.)

11

12 Feedback Feedback is generally beneficial. Useful to display the median estimate at other design points, other than those points where all but one of the covariates are at their reference values. Mason (2008) used Elicitor to question an expert about non-random non-response in a longitudinal survey. Reference point was for best response-rate. Worst case setting of the covariates gave a response rate of only 1%. The expert revised his median assessments and the worst- case response-rate increased to 9%, which the expert still thought was too low. The response-rate rapidly diminishes as probabilities are multiplied. Intend adding this feedback option to the software.

13

14

15

16

17

18

19 Examples O’Leary et al. (2009a) give an example where two experts assessed the probability of presence/absence for the brush-tailed rock- wallaby using Elicitor. Only two covariates: (i) Aspect (northerly vs other) (ii) Slope (0 o - 90 o ). O’Leary et al. (2009b) also gives an example where presence/absence for this wallaby is assessed – this time by only one expert but using four different methods, with aspect as the only covariate. Data: presence at 41 sites and absence at 9 (rare species? pest?)

20 Assessments of the two experts (O’Leary et al., 2009a)

Classification rates of four methods (O’Leary et al., 2009b) MethodPredictedObserved present absent Elicitorpresent419 absent00 Map-methodpresent01 absent418 Questionnairepresent419 absent00 Classification tree present351 absent68

22 Kynn (2004) gives five case studies conducted during the development of Elicitor where ecologists used it to quantify their opinions about an endangered species. Two of the studies had sample data with which to evaluate models. Ground parrot 137 presences and 438 pseudo-absences. 80% of the data was used to fit models and 20% for testing. Two continuous covariates, a factor with three levels and a second factor with four levels. Three models were considered: (a)Assessed prior + data (b)“Relaxed” prior + data (relaxed: variances were multiplied by 10) (c) Classical logistic stepwise regression.

23 Classification rates for ground parrot (Kynn, 2005) Stepwise does best – presumably variable selection helps. It used just the two continuous variables. MethodPredictedObserved present absent Assessed prior + data present2816 absent148 Relaxed prior + data present2211 absent753 Frequentist stepwise present2710 absent254

24 Criteria for threshold: minimise

25 Stemmacantha (a thistle) 203 presences and 2741 absences. Same three models; 80% of the data for fitting & 20% for testing. Stemmacantha Ground Parrat

26 Classification rates for Stemmacantha (Kynn, 2005) Numbers are inconsistent, but there seems little to choose between the methods. MethodPredictedObserved present absent Assessed prior + data present3377 absent11457 Relaxed prior + data present3483 absent6461 Frequentist stepwise present3269 absent15468

27 Garthwaite (1998) and Garthwaite & Al-Awadhi (2006) also quantify the opinion of ecologists about rare species in Queensland. Central Government wanted State Government to estimate habitat distribution of rare and endangered species. Some sample data were gathered. The aim was to link the data, ecologists’ knowledge and a GIS database to relate the probability of presence/absence to a large number of covariates. Preliminary meeting with about a dozen ecologists indicated that nonlinear relationships were needed to model their opinion (hence the piecewise linear models). Little bent-wing bat. (5 variables and 8 factors, giving 57 regression coefficients. Data: 42 presences in 375 sites.)

28 Plumed frogmouth. (7 variables, 3 factors; 58 parameters). Data: 31 presences in 324 sites. Powerful owl. (1 variable, 5 factors; 24 parameters). Data: 13 presences in 324 sites. Greater glider. (7 variables, 4 factors; 60 parameters). Data: 53 presences in 343 sites. Common bent-wing bat. (4 variables, 7 factors; 59 parameters). Data: 13 presences in 375 sites.

29 Various prior distributions were fitted to compensate for systematic biases in the expert’s assessments. 1.(β 0, β 1,…, β k ) multivariate normal. 2.β 0 diffuse, (β 1,…, β k ) ~ MVN(b, Σ). 3.θ, β 0 diffuse, (β 1,…, β k ) ~ MVN(θb, θ 2 Σ). 4.γ, θ, β 0 diffuse, (β 1,…, β k ) ~ MVN(θb, γΣ). Cross-validation: Repeatedly using 80% of the data for fitting and 20% for testing. Squared error loss was used to measure performance.

30 Little bent- wing bat Common b-w bat Plumed frogmouth Powerful Owl Greater glider Prior Prior Prior Prior Stepwise logistic Regression Prior: no data Prior 3 (constant term given diffuse prior and all coefficients multiplied by a constant) is the best for each animal – noticeably better for the plumed frogmouth. The prior with no data is comparable with stepwise regression except for the greater glider. There is quite limited data.

31 A second example: Air pollution in (Khaldiya) Kuwait City Khaldiya had a mobile laboratory station to monitor pollution for one year. Focus is on the probability of pollutants exceeding harmful threshold level. There are two permanent fixed laboratory stations: 5 km north-east and 5 km south-west of Khaldiya. Aim is to use the data and the opinion of two scientists to relate Khaldiya pollution to the permanent laboratories. Pollutants: SO 2, NO 2 and n-CH 4 (non-methane). Scientists quantified their opinions separately. Variables: pollution levels at the permanent labs, temperature, wind speed, humidity, height of the inversion line.

32 Expert A/ SO 2 Expert A/ NO 2 Expert B/ NO 2 Expert A/ n-CH 4 Expert B/ n-CH 4 Prior Prior Prior Prior Stepwise logistic Regression Prior: no data Non-methane: priors seem poor as priors + no data do much worse than other methods; stepwise logistic does better than using expert B’s prior but not expert A’s, especially with Prior 2. For SO 2 and NO 2, the prior’s seem better and prior + data does better than stepwise logistic regression. Prior 2 is perhaps the best.

33 (Not Kuwait City)

34 A medical application The UK National Health Service (NHS) initiated a study to estimate the benefits of current bowel cancer services in England and examine costs and benefits of alternative developments in service provision. ScHARR developed a treatment pathway model that gave the possible sequences of presentation, diagnosis, treatment and outcomes that could be followed by a patient with suspected colorectal (bowel) cancer. Available information supplied most of the required numbers but expert opinion filled in gaps. The resulting report states, “Owing to a lack of empirical evidence in a number of areas, several of the model parameter and details of the model structure were elicited from experts.”

35 For two quantities there were covariates. For these, the new version of the software was used to quantify consultants’ opinions. Choice of diagnostic test had level of fitness as a covariate. Choice of adjuvant chemotherapy had five covariates (mostly factors): age, tumor location, disease status, perforation/obstruction, and fitness for cytotoxic therapy. Results were validated where possible. Commenting on assessments about adjuvant chemotherapy the YHEC- ScHARR report notes that “The [pathways] model uses expert 1’s responses as part of a generalised linear model and is validated by expert 2’s responses.” The use of elicitation in the study is reported in Garthwaite, Chilcott, Jenkinson & Tappenden (2008).

36 Al-Awadhi & Garthwaite (2006). Computational statistics, 21, Garthwaite (1998). Quantifying expert opinion for modelling habitat distributions. Sustainable Forest Management Tech. Report, Queensland Depart. Natural Resources. Garthwaite & Al-Awadhi (2006). Tech. Report 06/07. Dept. Statistics, Open University. Garthwaite, Chilcott, Jenkinson & Tappenden (2008). Int. J. Technology assessment in Health Care, 24, Kynn (2005). Eliciting expert knowledge for Bayesian logistic regression in species habitat modelling in natural resources. PhD thesis. Queensland University of Technology. Mason (2008). Methodological developments for combining data. O’Leary, Choy, Kynn, Denham, Martin, Mengersem & Murray (2009a). Environmetrics, 20, O’Leary, Mengersem, Murray & Choy (2009b). Comparison of four expert elicitation methods. 18 th World IMACS/MODSIM Congress.

37

38