A Primer on Running Deterministic Experiments

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Rachel T. Johnson Douglas C. Montgomery Bradley Jones
Introduction to Design of Experiments by Dr Brad Morantz
Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Designing Ensembles for Climate Prediction
Pattern Recognition and Machine Learning
Kriging.
Experimental Design, Response Surface Analysis, and Optimization
Cost of surrogates In linear regression, the process of fitting involves solving a set of linear equations once. For moving least squares, we need to.
Ch11 Curve Fitting Dr. Deshi Ye
Prénom Nom Document Analysis: Parameter Estimation for Pattern Recognition Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Lecture 17 Today: Start Chapter 9 Next day: More of Chapter 9.
Linear and generalised linear models
General Mining Issues a.j.m.m. (ton) weijters Overfitting Noise and Overfitting Quality of mined models (some figures are based on the ML-introduction.
CSC2535: 2013 Advanced Machine Learning Lecture 3a: The Origin of Variational Bayes Geoffrey Hinton.
Space-Filling DOEs Design of experiments (DOE) for noisy data tend to place points on the boundary of the domain. When the error in the surrogate is due.
Radial Basis Function Networks
Gaussian process modelling
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
Engineering Statistics ENGR 592 Prepared by: Mariam El-Maghraby Date: 26/05/04 Design of Experiments Plackett-Burman Box-Behnken.
EQT373 STATISTIC FOR ENGINEERS Design of Experiment (DOE) Noorulnajwa Diyana Yaacob School of Bioprocess Engineering Universiti Malaysia Perlis 30 April.
Chapter 11Design & Analysis of Experiments 8E 2012 Montgomery 1.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Confidence intervals and hypothesis testing Petter Mostad
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
CHAPTER 17 O PTIMAL D ESIGN FOR E XPERIMENTAL I NPUTS Organization of chapter in ISSO –Background Motivation Finite sample and asymptotic (continuous)
Additional Topics in Prediction Methodology. Introduction Predictive distribution for random variable Y 0 is meant to capture all the information about.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Efficient Gaussian Process Regression for large Data Sets ANJISHNU BANERJEE, DAVID DUNSON, SURYA TOKDAR Biometrika, 2008.
Evolving RBF Networks via GP for Estimating Fitness Values using Surrogate Models Ahmed Kattan Edgar Galvan.
Gaussian Process and Prediction. (C) 2001 SNU CSE Artificial Intelligence Lab (SCAI)2 Outline Gaussian Process and Bayesian Regression  Bayesian regression.
Javad Azimi, Ali Jalali, Xiaoli Fern Oregon State University University of Texas at Austin In NIPS 2011, Workshop in Bayesian optimization, experimental.
Computacion Inteligente Least-Square Methods for System Identification.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Introduction to emulators Tony O’Hagan University of Sheffield.
8 Sept 2006, DEMA2006Slide 1 An Introduction to Computer Experiments and their Design Problems Tony O’Hagan University of Sheffield.
Kriging - Introduction Method invented in the 1950s by South African geologist Daniel Krige (1919-) for predicting distribution of minerals. Became very.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Chapter 7. Classification and Prediction
LINEAR CLASSIFIERS The Problem: Consider a two class task with ω1, ω2.
Deep Feedforward Networks
Analysis of Definitive Screening Designs
Part 5 - Chapter
Probability Theory and Parameter Estimation I
Goal We present a hybrid optimization approach for solving global optimization problems, in particular automated parameter estimation models. The hybrid.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
DESIGN OF EXPERIMENTS.
Vibration in turbine blades must be prevented
CSE 4705 Artificial Intelligence
Linear Mixed Models in JMP Pro
CH 5: Multivariate Methods
Reliability based design optimization
Basic machine learning background with Python scikit-learn
Chapter 5 Introduction to Factorial Designs
Modern Design of Experiments for Rapid and Active Learning
Design and Analysis of Nonlinear Models for the Mars 2020 Rover
Lithography Diagnostics Based on Empirical Modeling
10701 / Machine Learning Today: - Cross validation,
Linear Model Selection and regularization
Pattern Recognition and Machine Learning
Ensemble learning Reminder - Bagging of Trees Random Forest
Introduction to Radial Basis Function Networks
Chapter 11 Variable Selection Procedures
Topological Signatures For Fast Mobility Analysis
Memory-Based Learning Instance-Based Learning K-Nearest Neighbor
Uncertainty Propagation
Presentation transcript:

A Primer on Running Deterministic Experiments Bradley Jones Principal Research Fellow JMP Division/SAS

Plan for this talk Introduce an internet-based simulator of a garden sprinkler. Investigate and compare several strategies for experimental design and analysis using the simulator.

Computer Experiments vs. Physical Experiments Fast Cheap Predictive of Real World? Low Accuracy Experiments LE Time to build prototypes Cost of prototypes Empirical Validity High Accuracy Experiments HE

Current Applications (partial list) Circuit Simulation (SPICE) Stress Analysis (FEA) Hurricane Tracking Weather Prediction Fluid Flow (FLUENT)

How are computer experiments different? No random error! So replication and randomization are useless. More complicated functions So low-order polynomial approximations may be inadequate. Statistical designs based on minimizing variance may not perform well based on their incorrect assumption.

Why design computer experiments? Individual runs may be expensive. (hours, days) They may depend on many variables. Need to develop a surrogate for the computer simulation model.

What do you want in a surrogate model? Adequate approximation to the computer simulation. Fast predictions for new points.

Standard Surrogate - Gaussian Process (GP) Pros of GP Models Flexibility Parsimonious in the number of unknowns No error for input settings where you have observed responses (i.e. GP models interpolate the data.)

Cons of GP models Estimating the GP parameters has technical challenges. Problems inverting near singular matrices Model fitting is slow compared to polynomial models. Interpolating the data does not mean perfect prediction for other factor combinations! Model is based on random functions but the modeled function is not random.

Alternative Surrogate Models Polynomial Regression Neural Nets Both using cross validation to avoid over-fitting.

Internet simulation tool https://perswww.kuleuven.be/~u0059569/doe/sprinkler/default.php

Sprinkler Simulation

Design Strategies for Comparison Maximin Latin Hypercube Design Covering Array Multi-level Orthogonal Design Fast Flexible Filling (FFF) Design

Maximin Latin Hypercube Design (Mm LHD) Construction of a random LHD with n runs. Each column is a random permutation of 1 to n Center and scale to match the desired low and high values Construction of an Mm LHD. Choose the column permutations so that the minimum distance between any two points in the design is maximized (i.e. spread the points apart).

Mm LHD with two factors and 11 runs

Properties of the Mm LHD Design points are equally spaced for every factor. No replication in projection. Design points are spread apart from each other more than for any other LHD

Covering Array New in JMP 12 Pro

Multi-level orthogonal design New in JMP 11 in the screening design tool

Fast Flexible Filling Designs New in JMP 11 in the space filling design tool Improved in JMP 12 with MaxPro optimization criterion

Fast Flexible Filling Design – MaxPro Criterion n is the number of runs and p is the number of factors Note that when xil– xjl gets small, m gets big. The goal of the design is to make m as small as possible.

FFF Design – Predicted Optimal Settings

Demonstrations using JMP Showing how to use JMP to create designs use the garden sprinkler simulator to generate responses use JMP to analyze the data and optimize operating settings