Crystal Linkletter and Derek Bingham Department of Statistics and Actuarial Science Simon Fraser University Acknowledgements This research was initiated.

Slides:



Advertisements
Similar presentations
Rachel T. Johnson Douglas C. Montgomery Bradley Jones
Advertisements

Yinyin Yuan and Chang-Tsun Li Computer Science Department
Insert Date HereSlide 1 Using Derivative and Integral Information in the Statistical Analysis of Computer Models Gemma Stephenson March 2007.
Running a model's adjoint to obtain derivatives, while more efficient and accurate than other methods, such as the finite difference method, is a computationally.
Introduction to Design of Experiments by Dr Brad Morantz
Design Rule Generation for Interconnect Matching Andrew B. Kahng and Rasit Onur Topaloglu {abk | rtopalog University of California, San Diego.
A.M. Alonso, C. García-Martos, J. Rodríguez, M. J. Sánchez Seasonal dynamic factor model and bootstrap inference: Application to electricity market forecasting.
Simon Fraser University Department of Statistics and Actuarial Sciences Some Random Questions.
Teaching Statistics Using Stata Software Susan Hailpern BSN MPH MS Department of Epidemiology and Population Health Albert Einstein College of Medicine.
Fast Bayesian Matching Pursuit Presenter: Changchun Zhang ECE / CMR Tennessee Technological University November 12, 2010 Reading Group (Authors: Philip.
Bayesian Nonparametric Matrix Factorization for Recorded Music Reading Group Presenter: Shujie Hou Cognitive Radio Institute Friday, October 15, 2010 Authors:
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Visual Recognition Tutorial
Cost-Sensitive Classifier Evaluation Robert Holte Computing Science Dept. University of Alberta Co-author Chris Drummond IIT, National Research Council,
Screening Experiments for Developing Dynamic Treatment Regimes S.A. Murphy At ICSPRAR January, 2008.
Screening Experiments for Dynamic Treatment Regimes S.A. Murphy At ENAR March, 2008.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan Florida: January, 2006.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan April, 2006.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan January, 2006.
Value of Information for Complex Economic Models Jeremy Oakley Department of Probability and Statistics, University of Sheffield. Paper available from.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Handouts Software Testing and Quality Assurance Theory and Practice Chapter 9 Functional Testing
Monte Carlo Simulation 1.  Simulations where random values are used but the explicit passage of time is not modeled Static simulation  Introduction.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Estimating the delay of the fMRI response C.H. Liao 1, K.J. Worsley 12, J-B. Poline 3, G.H. Duncan 4, A.C. Evans 2 1 Department of Mathematics.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Quantitative Skills 1: Graphing
Bayesian Sets Zoubin Ghahramani and Kathertine A. Heller NIPS 2005 Presented by Qi An Mar. 17 th, 2006.
Modeling and Validation of a Large Scale, Multiphase Carbon Capture System William A. Lane a, Kelsey R. Bilsback b, Emily M. Ryan a a Department of Mechanical.
SPM Course Zurich, February 2015 Group Analyses Guillaume Flandin Wellcome Trust Centre for Neuroimaging University College London With many thanks to.
Using Resampling Techniques to Measure the Effectiveness of Providers in Workers’ Compensation Insurance David Speights Senior Research Statistician HNC.
Discrete Distributions The values generated for a random variable must be from a finite distinct set of individual values. For example, based on past observations,
Yaomin Jin Design of Experiments Morris Method.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Receptor Occupancy estimation by using Bayesian varying coefficient model Young researcher day 21 September 2007 Astrid Jullion Philippe Lambert François.
Corinne Introduction/Overview & Examples (behavioral) Giorgia functional Brain Imaging Examples, Fixed Effects Analysis vs. Random Effects Analysis Models.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Multifactor GPs Suppose now we wish to model different mappings for different styles. We will add a latent style vector s along with x, and define the.
STA 216 Generalized Linear Models Meets: 2:50-4:05 T/TH (Old Chem 025) Instructor: David Dunson 219A Old Chemistry, Teaching.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Categorical Independent Variables STA302 Fall 2013.
July 11, 2006Bayesian Inference and Maximum Entropy Probing the covariance matrix Kenneth M. Hanson T-16, Nuclear Physics; Theoretical Division Los.
CHAPTER 17 O PTIMAL D ESIGN FOR E XPERIMENTAL I NPUTS Organization of chapter in ISSO –Background Motivation Finite sample and asymptotic (continuous)
A generalized bivariate Bernoulli model with covariate dependence Fan Zhang.
Additional Topics in Prediction Methodology. Introduction Predictive distribution for random variable Y 0 is meant to capture all the information about.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
by Ryan P. Adams, Iain Murray, and David J.C. MacKay (ICML 2009)
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
Designing Factorial Experiments with Binary Response Tel-Aviv University Faculty of Exact Sciences Department of Statistics and Operations Research Hovav.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
SAMSI March 2007 GASP Models and Bayesian Regression David M. Steinberg Dizza Bursztyn Tel Aviv University Ashkelon College.
Gaussian Process Networks Nir Friedman and Iftach Nachman UAI-2K.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Lynette.
Dario Grana and Tapan Mukerji Sequential approach to Bayesian linear inverse problems in reservoir modeling using Gaussian mixture models SCRF Annual Meeting,
Non-parametric Methods for Clustering Continuous and Categorical Data Steven X. Wang Dept. of Math. and Stat. York University May 13, 2010.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Gaussian Mixture Model classification of Multi-Color Fluorescence In Situ Hybridization (M-FISH) Images Amin Fazel 2006 Department of Computer Science.
Uncertainty Quantification and Bayesian Model Averaging
Bayesian Semi-Parametric Multiple Shrinkage
BINARY LOGISTIC REGRESSION
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
STA 216 Generalized Linear Models
Variable Selection for Gaussian Process Models in Computer Experiments
STA 216 Generalized Linear Models
Comparisons among methods to analyze clustered multivariate biomarker predictors of a single binary outcome Xiaoying Yu, PhD Department of Preventive Medicine.
Multivariate Methods Berlin Chen
Facultad de Ingeniería, Centro de Cálculo
Presentation transcript:

Crystal Linkletter and Derek Bingham Department of Statistics and Actuarial Science Simon Fraser University Acknowledgements This research was initiated while Linkletter, Bingham and Ye were visiting the Statistical Sciences group at Los Alamos National Laboratory. This work was supported by a grant from the Natural Sciences and Engineering Research Council of Canada. Ye’s research supported by NSD DMS Conclusions and Future Research RDVS is a new method for variable selection for Bayesian Gaussian Spatial Process models. The methodology is motivated by asking: what would the posterior distribution of the correlation parameter for an inert factor look like given the data? The approach is Bayesian and only requires the generation of an inert factor, but the screening has a frequentist flavour, using the distribution of the inert factor as a reference distribution. Future research: Using a linear regression model for the mean of the GASP model Using RDVS for variable selection for other models. Computer Experiment Example Taylor Cylinder Experiment (Los Alamos National Lab) This is a finite element code used to simulate the high velocity impact of a cylinder. In the experiment, copper cylinders (length 5.08 cm, radius 1 cm) are fired into a fixed barrier at a velocity of 177 m/s. The cylinder length after impact is used as the outcome. The process is governed by 14 parameters which control the behaviour of the cylinder after impact. Over the limited range that the computer experiment exercises the simulator, it is expected that the response is dominated by only a few of the 14 parameters. Introduction Computer simulators often require a large number of inputs and are computationally demanding. A main goal of computer experimentation may be screening, identifying which inputs have a significant impact on the process being studied. Gaussian spatial process (GASP) models are commonly used to model computer simulators. These models are flexible, but make variable selection challenging. We present reference distribution variable selection (RDVS) as a new approach to screening for GASP models. Gaussian Spatial Process Model To model the response from a computer experiment, we use a Bayesian version of the GASP model originally used by Sacks et al. (1989): y(X): Simulator response – (n x 1) vector X: Input to the computer code – (n x p) design matrix  : White-noise process, independent of z(X) The Gaussian spatial process, z(X), is specified to have mean zero and covariance function Under this parameterization, if  k is close to one, the k th input is not active. RDVS is a method for gauging the relative magnitudes of the correlation parameters  k. Results Simulated Example We used a 54-run space-filling Latin hypercube design with p=10 factors. The response is generated by: A GASP model is used to analyse the generated response and the RDVS algorithm is used to identify the first four factors as active: Posterior distributions for correlation parameters of 10 factors. The horizontal line marks the 10 th percentile of the reference distribution. Correlation parameters with posterior medians below this line indicate active factors. Taylor Cylinder Experiment A 118-run 5-level nearly-orthogonal design was used. Exploratory analysis suggests factor 6 is important, otherwise significant factors are difficult to identify: RDVS identifies factor 6 and six other factors as having a significant impact on cylinder deformation. Discussion RDVS is able to correctly identify when none of the true factors are active. This variable selection technique complements methods in sensitivity analysis. It can be used as a precursor to alternative visualization and ANOVA approaches to screening. The method is robust to the specification of the prior distributions. Since the inert variable is assigned the same prior as the true factors, the method self-calibrates. Variable Selection for Gaussian Process Models in Computer Experiments RDVS Algorithm To implement RDVS, a factor which is known to be inert is appended to the design matrix X. This provides a benchmark against which the other input factors can be compared. Algorithm 1.Augment the design matrix by adding a new design column corresponding to an inert factor. 2.Find the posterior median of the correlation parameter corresponding to the dummy factor. 3.Repeat steps 1. and 2. many times to obtain the distribution of the posterior median of an inert factor to use as a reference distribution. 4.Compare the posterior medians of the correlation parameters of the true factors to the reference distribution. The percentile of the reference distribution used for comparison reflects the rate of falsely identifying an inert factor as active. David Higdon and Nick Hengartner Statistical Sciences Discrete Event Simulations Los Alamos National Laboratory Kenny Q. Ye Department of Epidemiology and Population Health Albert Einstein College of Medicine