1 Using Multiple Surrogates for Metamodeling Raphael T. Haftka (and Felipe A. C. Viana University of Florida.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION GROUP
Uncertainty in fall time surrogate Prediction variance vs. data sensitivity – Non-uniform noise – Example Uncertainty in fall time data Bootstrapping.
Pattern Recognition and Machine Learning
Kriging.
Cost of surrogates In linear regression, the process of fitting involves solving a set of linear equations once. For moving least squares, we need to form.
Pattern Recognition and Machine Learning: Kernel Methods.
Cost of surrogates In linear regression, the process of fitting involves solving a set of linear equations once. For moving least squares, we need to.
Model generalization Test error Bias, variance and complexity
Curve fit metrics When we fit a curve to data we ask: –What is the error metric for the best fit? –What is more accurate, the data or the fit? This lecture.
Including Uncertainty Models for Surrogate-based Global Optimization
Classification and Prediction: Regression Via Gradient Descent Optimization Bamshad Mobasher DePaul University.
Surrogate model based design optimization
Linear Methods for Regression Dept. Computer Science & Engineering, Shanghai Jiao Tong University.
Evaluating Hypotheses
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Curve fit noise=randn(1,30); x=1:1:30; y=x+noise ………………………………… [p,s]=polyfit(x,y,1);
Classification and Prediction: Regression Analysis
Minimum Weight Wing Design for a Utility Type Aircraft MIDDLE EAST TECHNICAL UNIVERSITY AE 462 – Aerospace Structures Design DESIGN TEAM : Osman Erdem.
Space-Filling DOEs Design of experiments (DOE) for noisy data tend to place points on the boundary of the domain. When the error in the surrogate is due.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Power Generation from Renewable Energy Sources
2015 AprilUNIVERSITY OF HAIFA, DEPARTMENT OF STATISTICS, SEMINAR FOR M.A 1 Hastie, Tibshirani and Friedman.The Elements of Statistical Learning (2nd edition,
PATTERN RECOGNITION AND MACHINE LEARNING
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Physics 114: Exam 2 Review Lectures 11-16
Optimization with surrogates Based on cycles. Each consists of sampling design points by simulations, fitting surrogates to simulations and then optimizing.
WB1440 Engineering Optimization – Concepts and Applications Engineering Optimization Concepts and Applications Fred van Keulen Matthijs Langelaar CLA H21.1.
1 6. Reliability computations Objectives Learn how to compute reliability of a component given the probability distributions on the stress,S, and the strength,
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Lecture 2 Basics of probability in statistical simulation and stochastic programming Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius,
Power Generation from Renewable Energy Sources Fall 2012 Instructor: Xiaodong Chu : Office Tel.:
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Sensitivity derivatives Can obtain sensitivity derivatives of structural response at several levels Finite difference sensitivity (section 7.1) Analytical.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
Review of fundamental 1 Data mining in 1D: curve fitting by LLS Approximation-generalization tradeoff First homework assignment.
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
INCLUDING UNCERTAINTY MODELS FOR SURROGATE BASED GLOBAL DESIGN OPTIMIZATION The EGO algorithm STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION GROUP Thanks.
Derivative-Enhanced Variable Fidelity Kriging Approach Dept. of Mechanical Engineering, University of Wyoming, USA Wataru YAMAZAKI 23 rd, September, 2010.
A. Brown MSFC/ED21 Using Plate Elements for Modeling Fillets in Design, Optimization, and Dynamic Analysis FEMCI Workshop, May 2003 Dr. Andrew M. Brown.
Optimization formulation Optimization methods help us find solutions to problems where we seek to find the best of something. This lecture is about how.
1 CHAP 8 STRUCTURAL DESIGN USING FINITE ELEMENTS FINITE ELEMENT ANALYSIS AND DESIGN Nam-Ho Kim Edited and audio Raphael Haftka.
1 Design of experiment for computer simulations Let X = (X 1,…,X p )  R p denote the vector of input values chosen for the computer program Each X j is.
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
Structural & Multidisciplinary Optimization Group Deciding How Conservative A Designer Should Be: Simulating Future Tests and Redesign Nathaniel Price.
The University of SydneySlide 1 Simulation Driven Biomedical Optimisation Andrian Sue AMME4981/9981 Week 5 Semester 1, 2016 Lecture 5.
Curve fit metrics When we fit a curve to data we ask: –What is the error metric for the best fit? –What is more accurate, the data or the fit? This lecture.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
Kriging - Introduction Method invented in the 1950s by South African geologist Daniel Krige (1919-) for predicting distribution of minerals. Became very.
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
Stats Methods at IC Lecture 3: Regression.
Regression and Correlation
Physics 114: Lecture 13 Probability Tests & Linear Fitting
Chapter 7. Classification and Prediction
Questions from lectures
March 9th, 2015 Maxime Lapalme Nazim Ould-Brahim
Actual Power Developed by A Rotor
Bias and Variance of the Estimator
CHAPTER 29: Multiple Regression*
Filtering and State Estimation: Basic Concepts
Curve fit metrics When we fit a curve to data we ask:
Curve fit metrics When we fit a curve to data we ask:
Presentation transcript:

1 Using Multiple Surrogates for Metamodeling Raphael T. Haftka (and Felipe A. C. Viana University of Florida

2 KEY QUESTIONS Is it possible to choose the best surrogate for a given problem? What are the advantages of using multiple surrogates in optimization? Surrogates are only approximations and as such they incur errors. This raises questions that will be discussed in this lecture:

3 WHY SO MANY SURROGATES? Different statistical models: -Response surface assumes known function and noise in data. -Kriging assumes exact data and random function. Different basis functions (polynomials, radial basis functions). Different loss functions (mostly SVR): -RMS is not sacred (L1 has some advantages). Popular surrogates: polynomial response surface. (PRS) kriging models (KRG), support vector regression (SVR). Each has multiple flavors.

4 WE USE ROOT MEAN SQUARE ERROR Root Mean Square Error ( RMSE ) in design domain with volume V: where is error in prediction of the surrogate model,. Compute RMSE by Monte-Carlo integration at large number of p test test points: Used only for assessing accuracy for test problems.

5 DOE FOR FIT AND DOE FOR INTEGRATION Example of design of experiments (DOE) used for fitting functions of 2 variables Two of the five highly dense DOEs used for RMSE estimation. RMSE is the average of the values obtained with the five DOEs.

6 HOW WE GENERATE A LARGE SET OF SURROGATES A set of 24 basic surrogates is generated by varying the model technique and the respective associated parameters.

7 NO BEST SURROGATE EVEN FOR GIVEN FUNCTION Branin-Hoo function (100 DOEs) 12 Points 20 Points For 11 test problems, 12 surrogates were the best at least 10 times. Every problem had at least 2 surrogates that worked the best at least 10 times.

8 TEST PROBLEMS Other analytical test functions

9 AIRFOIL APPLICATION AIRCRAFT TAKEOFF PERFORMANCE: Requires airfoil lift, drag and pitching moment coefficients. 11 design variables: 10 variables describing the airfoil and one for the angle of attack. 450 simulations: 156 points for fitting and 294 points for RMSE computation. Points selected randomly for 100 DOEs.

10 CROSS-VALIDATION ERRORS One data point is ignored and surrogate fitted to other p – 1 points. Repeat for each data point to obtain the vector of PRESS errors,. For large p, k-fold strategy used instead. Leaves k points out each time. With the PRESS vector, estimate RMSE as: We can now compare surrogates on the basis of their PRESS error.

11 CORRELATION IMPROVES WITH NUMBER OF POINTS CONCLUSION: With enough points (even sparse) can use PRESS RMS to choose a good surrogate. Mean value of the correlation between PRESS RMS and RMSE (out of 100 experiments).

12 FREQUENCY OF BestRMSE vs. BestPRESS For large number of points, the best 3 surrogates according to both RMSE (in blue) and PRESS RMS (in red) tend to be the same.

13 COMPUTATIONAL COST Wall (wait) time on an Intel Core2 T GHz, 2GB or RAM laptop, running MATLAB 7.0 under Windows XP.

14 PASSIVE HELICOPTER VIBRATION REDUCTION In helicopters, the dominant source of vibrations is the rotor (N b /rev). In the passive approach: –Objective function consists of a suitable combination of the N b /rev hub loads –Constraints: stability margin, frequency placement, autorotation, side constraints –Design variables: cross-sectional dimensions, mass and stiffness distributions along the span, pretwist, and geometrical parameters which define advanced geometry tips Aerodynamic environment is expensive to model Glaz, B, Goel, T, Liu, L, Haftka, RT, Friedmann. (2009)“Multiple-Surrogate Approach to Helicopter Rotor Blade Vibration Reduction” AIAA Journal,Vol 47(1), 271–282

15 OPTIMIZATION PROBLEM Objective function to be minimized: – –Weighted sum of the 4/rev oscillatory hub shear resultant and the 4/rev oscillatory hub moment resultant 17 Design Variables: t 1, t 2, t 3, and m ns –Three thickness defined at 0%, 25%, 50%, 75%, and 100% blade stations –Non-structural mass is defined at 68% and 100% stations

16 EXPENSIVE STRESS CONSTRAINT Assuming isotropy, Von Mises’ criterion is used to determine if the blade yields, with a factor of safety. – –Constraint is enforced at a set of discrete points. Calculation of blade stresses is as expensive as a vibration objective function evaluation since a forward flight simulation is needed. A surrogate used for this constraint.

17 Weighted (PWS) Surrogate Construction Optimal Latin hypercubes (OLH) used to create surrogates –Out of a 300 pt. OLH, 283 had converged trim solutions (53 hours) –Out of a 500 pt. OLH, 484 had converged trim solutions (82 hours) Each simulation took 8 hours, with run in parallel Fitting plus PRESS took 7-10 minutes for 283 points, minutes for 484 Weights in table inversely proportional to PRESS error. Weight Coefficient Sample Size F 4X F 4Y F 4Z M 4X M 4Y M 4Z JStress Constraint w poly w krg w RBNN w poly w krg w RBNN

18 Errors at 197 test points 283: Kriging has lowest error 484: Polynomials are lowest in some instances 283 Sample Points484 Sample Points Average Errors

19 Optimization Results Among individual approximation methods: –Poly. result in the best design with 283 sample points, and the worst design with 484 sample points –RBNN lead to the best design with 484 sample points Each optimization required 2-4 hours with about 200,000 function evaluations. None of the surrogates led to the same design SurrogateSample Size Vibration Reduction Poly.28364% KRG28354% RBNN28357% Poly.48445% KRG48456% RBNN48468% Vibration reduction (relative to MBB BO-105)

20 CONCLUDING REMARKS The most accurate surrogate for a given function depends on the design of experiments and point density. The cross validation error identifies accurate surrogates well, especially as the number of points in the DOE increases. Cost of fitting multiple surrogates and calculating cross- validation errors low enough to use now for most expensive simulation problems. Optimizing with several surrogates adds little to overall cost, and the best design may be obtained by a less accurate surrogate.