Vladimir Fonoberov (Aimdyn Inc, USA) Igor Mezic (UC Santa Barbara, USA)

Slides:



Advertisements
Similar presentations
Rachel T. Johnson Douglas C. Montgomery Bradley Jones
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Running a model's adjoint to obtain derivatives, while more efficient and accurate than other methods, such as the finite difference method, is a computationally.
Regularized risk minimization
Uncertainty Quantification & the PSUADE Software
Fast Algorithms For Hierarchical Range Histogram Constructions
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Dynamics of Learning VQ and Neural Gas Aree Witoelar, Michael Biehl Mathematics and Computing Science University of Groningen, Netherlands in collaboration.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Lecture Notes for CMPUT 466/551 Nilanjan Ray
Part 4 b Forward-Backward Algorithm & Viterbi Algorithm CSE717, SPRING 2008 CUBS, Univ at Buffalo.
TRADING OFF PREDICTION ACCURACY AND POWER CONSUMPTION FOR CONTEXT- AWARE WEARABLE COMPUTING Presented By: Jeff Khoshgozaran.
Logistic Regression Rong Jin. Logistic Regression Model  In Gaussian generative model:  Generalize the ratio to a linear model Parameters: w and c.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Lecture 4 Measurement Accuracy and Statistical Variation.
A) Transformation method (for continuous distributions) U(0,1) : uniform distribution f(x) : arbitrary distribution f(x) dx = U(0,1)(u) du When inverse.
Curve fit noise=randn(1,30); x=1:1:30; y=x+noise ………………………………… [p,s]=polyfit(x,y,1);
Introduction to ModelingMonte Carlo Simulation Expensive Not always practical Time consuming Impossible for all situations Can be complex Cons Pros Experience.
1 Assessment of Imprecise Reliability Using Efficient Probabilistic Reanalysis Farizal Efstratios Nikolaidis SAE 2007 World Congress.
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.
Efficient Model Selection for Support Vector Machines
 1  Outline  stages and topics in simulation  generation of random variates.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Part 5 Parameter Identification (Model Calibration/Updating)
Monte Carlo Simulation CWR 6536 Stochastic Subsurface Hydrology.
Least-Mean-Square Training of Cluster-Weighted-Modeling National Taiwan University Department of Computer Science and Information Engineering.
New data analysis for AURIGA Lucio Baggio Italy, INFN and University of Trento AURIGA.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Overview of Supervised Learning Overview of Supervised Learning2 Outline Linear Regression and Nearest Neighbors method Statistical Decision.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Museum and Institute of Zoology PAS Warsaw Magdalena Żytomska Berlin, 6th September 2007.
Ch 4. Linear Models for Classification (1/2) Pattern Recognition and Machine Learning, C. M. Bishop, Summarized and revised by Hee-Woong Lim.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
A new Ad Hoc Positioning System 컴퓨터 공학과 오영준.
MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:
1 Using Multiple Surrogates for Metamodeling Raphael T. Haftka (and Felipe A. C. Viana University of Florida.
Introduction to Digital Signals
0 - 1 © 2007 Texas Instruments Inc, Content developed in partnership with Tel-Aviv University From MATLAB ® and Simulink ® to Real Time with TI DSPs Spectrum.
Quality of model and Error Analysis in Variational Data Assimilation François-Xavier LE DIMET Victor SHUTYAEV Université Joseph Fourier+INRIA Projet IDOPT,
Cluster validation Integration ICES Bioinformatics.
DFT Applications Technology to calculate observables Global properties Spectroscopy DFT Solvers Functional form Functional optimization Estimation of theoretical.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Monte-Carlo based Expertise A powerful Tool for System Evaluation & Optimization  Introduction  Features  System Performance.
Gaussian Processes For Regression, Classification, and Prediction.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Machine Learning 5. Parametric Methods.
Stochastic Loss Reserving with the Collective Risk Model Glenn Meyers ISO Innovative Analytics Casualty Loss Reserving Seminar September 18, 2008.
Chance Constrained Robust Energy Efficiency in Cognitive Radio Networks with Channel Uncertainty Yongjun Xu and Xiaohui Zhao College of Communication Engineering,
1 Tom Edgar’s Contribution to Model Reduction as an introduction to Global Sensitivity Analysis Procedure Accounting for Effect of Available Experimental.
Tutorial 2, Part 2: Calibration of a damped oscillator.
Anders Nielsen Technical University of Denmark, DTU-Aqua Mark Maunder Inter-American Tropical Tuna Commission An Introduction.
Learning A Better Compiler Predicting Unroll Factors using Supervised Classification And Integrating CPU and L2 Cache Voltage Scaling using Machine Learning.
Introduction to emulators Tony O’Hagan University of Sheffield.
Introduction We consider the data of ~1800 phenotype measurements Each mouse has a given probability distribution of descending from one of 8 possible.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Uncertainty quantification in generic Monte Carlo Simulation: a mathematical framework How to do it? Abstract: Uncertainty Quantification (UQ) is the capability.
OPERATING SYSTEMS CS 3502 Fall 2017
Chapter 7. Classification and Prediction
Probability Theory and Parameter Estimation I
Monte Carlo Simulation Managing uncertainty in complex environments.
A Unifying View on Instance Selection
Igor V. Cadez, Padhraic Smyth, Geoff J. Mclachlan, Christine and E
Biointelligence Laboratory, Seoul National University
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
Optimization under Uncertainty
Presentation transcript:

Vladimir Fonoberov (Aimdyn Inc, USA) Igor Mezic (UC Santa Barbara, USA)

Computer models of physical and biological systems Matching model outputs to known data Global sensitivity GoSUM software Calibration of model of human cardiovascular system Conclusion 2

Complex models: dynamical systems 100s of parameters 10s of outputs 3

Commonly, the cost function to be minimized is L 2 norm of the difference between model outputs and real data For time-dependent outputs, both spectral and time domains can be matched, resulting in a better fit static modeldynamic model 4

5 Global measure of importance of model parameters for the outputs of interest NOT local sensitivity, e.g. partial derivatives around the nominal value Global sensitivity allows us to find parameters which are important over the entire range of interest of model parameters Very computationally intensive for very large models

6 Derivative-based, e.g. First-order sensitivity index Total effect sensitivity index

7 Direct evaluation of sensitivity integrals requires the model executable to be available and has complexity of N samp  Dimension model evaluations An analytical model representation can be created by “learning” the model (when dimension or model runtime is very large, this is the only way) We employ robust support vector regression (SVR) algorithms to learn the model The learning is based on maximizing predictive power of the analytical model representation

8 Analytical model representation where is k th input parameter sample When the number of parameters D is large and the number of samples N is comparable to or less than D, the analytical model representation may become over-smoothed This is because all parameters are considered equally important => model reduction is needed

9 Removing irrelevant parameters from the model allows us to Simplify the model Find more accurate analytical model representation and perform predictions Identify important parameters and speed-up uncertainty quantification, optimization, etc.

10

11 Each of the D input parameters has a prescribed distribution (uniform, Gaussian, exponential, discrete, etc.) Depending on available time, one estimates the number of times N the model executable can be run GoSUM generates exactly N (D-dimensional) points that accurately sample the joint probability distribution of input parameters using Latinized CVT (up to a few thousand samples) and DSample (any number of samples) 300 samples MC versus Latinized CVT 5,000 samples MC versus DSample

12 Supported model executable formats Windows executable MATLAB / Octave function Simulink model No executable (have model outputs) For every input sample, GoSUM evaluates the Model Executable in parallel (using a specified number of CPU cores) Model evaluations can be stopped by user

Model of a medium office building: 3 floors, 50,000 square feet, 15 zones 941 parameters, 10% uncertainty, uniform and exponential distributions 16 outputs: yearly energy consumption for different systems 5,000 samples Our recent publications: B. Eisenhower, Z. O’Neill, S. Narayanan, V. A. Fonoberov, and I. Mezic, "A Methodology for Meta-Model Based Optimization in Building Energy Models," Energy and Buildings 47, pp , 2012 B. Eisenhower, Z. O'Neill, V. A. Fonoberov, and I. Mezic, "Uncertainty and Sensitivity Decomposition of Building Energy Models," Journal of Building Performance Simulation 5, pp ,

14

15 GoSUM leans models using kernel-based support vector machine methods With regularization of noisy data SVR (L2) Epsilon-SVR (L1) Nu-SVR (L1) Without noisy data regularization RS (Response Surface) The learning is based on maximizing predictive power of analytical model representation

16 Global derivative sensitivity (L2) Global derivative sensitivity (L1) Average derivative Global variance sensitivity ANOVA decomposition based on analytical model representation

17

18

19 The problem is to minimize (or maximize) a function of model parameters and model outputs f(par 1, par 2, …, out 1, out 2, …) in a given range of parameters a i  par i  b i, i = 1, …, D and subject to arbitrary equality and/or inequality constrains 1000s of dimensions global maximum

Global stability option in GoSUM will perform global optimization of f(par 1, par 2, …, out 1, out 2, …, unc 1, unc 2, …) in a given range of parameters a i  par i  b i, i = 1, …, D subject to arbitrary equality and/or inequality constrains and in the presence of uncertain parameters unc 1, unc 2, … with known distributions 20 requirement level operation point out of specification 1000s of dimensions functional regime

21

22

23 When input samples are imported, arbitrary correlated distributions are automatically recognized Correlations are supported for all distributions types: continuous, discrete, categorical All correlations are regularized to reveal uncorrelated parameters Samples can be generated from arbitrary data-defined correlated distributions Optimization volume can be constrained by data boundaries

Respi- ration Vuev Vusv Vrv Vlv Emaxrv Emaxlv Heart Period xTsxTv B1, B2, 6P tasks 300 seconds each 24 about 20 states and 100 parameters

25

26

27 Define cost function to be minimized: Use GoSUM to analyze global sensitivity of cost function terms and identify the most important parameters Use GoSUM to minimize the cost function with respect to the most important parameters Use GoSUM to study sensitivity of the solution Repeat the steps above for all subjects of interest HPL - relative difference (error) between power spectral density (PSD) of the calculated and experimental heart periods for low frequencies; HPH - heart period relative PSD error for high frequencies; HPV - heart period relative PSD error for very low frequencies; HP - relative error in mean heart period; SAP - relative error in mean systolic blood pressure; DAP - relative error in mean diastolic blood pressure; SAPT - systolic arterial pressure relative PSD error; DAPT - diastolic arterial pressure relative PSD error

Statistics over 24 subjects 28

29 Model personalization is very important

30 A method for fast global sensitivity analysis of arbitrary high- dimensional models is developed A complex model of human cardiovascular system is studied and calibrated for individual subjects For the considered model, global-sensitivity based calibration required optimization over only 25% of model parameters All algorithms are implemented in our software GoSUM: software documentation and online tutorials are available at We would like to thank our sponsors and customers: Hamilton Sundstrand, Ford, Boeing, AFOSR, NIH, and DARPA