Mordechai Shacham, Dept. of Chem

Slides:



Advertisements
Similar presentations
Chapter 7 Statistical Data Treatment and Evaluation
Advertisements

Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Read Chapter 17 of the textbook
© 2010 Pearson Prentice Hall. All rights reserved Regression Interval Estimates.
GG 313 Geological Data Analysis # 18 On Kilo Moana at sea October 25, 2005 Orthogonal Regression: Major axis and RMA Regression.
Curve-Fitting Regression
Development of Empirical Models From Process Data
Prediction and model selection
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Short Term Load Forecasting with Expert Fuzzy-Logic System
DATA QUALITY and ANALYSIS Strategy for Monitoring Post-fire Rehabilitation Treatments Troy Wirth and David Pyke USGS – Biological Resources Division Forest.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: prediction Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
1 Assessment of Imprecise Reliability Using Efficient Probabilistic Reanalysis Farizal Efstratios Nikolaidis SAE 2007 World Congress.
Least-Squares Regression
Multiple Linear and Polynomial Regression with Statistical Analysis Given a set of data of measured (or observed) values of a dependent variable: y i versus.
3 CHAPTER Cost Behavior 3-1.
Ch 8.1 Numerical Methods: The Euler or Tangent Line Method
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Chem. 31 – 9/23 Lecture Guest Lecture Dr. Roy Dixon.
Fundamentals of Data Analysis Lecture 9 Management of data sets and improving the precision of measurement.
Analytical vs. Numerical Minimization Each experimental data point, l, has an error, ε l, associated with it ‣ Difference between the experimentally measured.
An Introduction to Programming and Algorithms. Course Objectives A basic understanding of engineering problem solving process. A basic understanding of.
Non-Linear Models. Non-Linear Growth models many models cannot be transformed into a linear model The Mechanistic Growth Model Equation: or (ignoring.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
MECN 3500 Inter - Bayamon Lecture 9 Numerical Methods for Engineering MECN 3500 Professor: Dr. Omar E. Meza Castillo
Predicting a Variety of Constant Pure Compound Properties by the Targeted QSPR Method Abstract The possibility of obtaining a reliable prediction a wide.
Curve-Fitting Regression
Considering Physical Property Uncertainties in Process Design Abstract A systematic procedure has been developed for process unit design based on the “worst.
Identifying Applicability Domains for Quantitative Structure Property Relationships Mordechai Shacham a, Neima Brauner b Georgi St. Cholakov c and Roumiana.
Solution of a Partial Differential Equations using the Method of Lines
Identification and Estimation of the Influential Parameters in Bioreaction Systems Mordechai Shacham Ben Gurion University of the Negev Beer-Sheva, Israel.
Evaluation of a Targeted-QSPR Based Pure Compound Property Prediction System Abstract The use of the DD – TQSPR (Dominant-Descriptor Targeted QSPR) method.
A "Reference Series" Method for Prediction of Properties of Long-Chain Substances Inga Paster and Mordechai Shacham Dept. Chem. Eng. Ben-Gurion University.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Non-Linear Models. Non-Linear Growth models many models cannot be transformed into a linear model The Mechanistic Growth Model Equation: or (ignoring.
A molecular descriptor database for homologous series of hydrocarbons ( n - alkanes, 1-alkenes and n-alkylbenzenes) and oxygen containing organic compounds.
Why The Bretz et al Examples Failed to Work In their discussion in the Biometrical Journal, Bretz et al. provide examples where the implementation of the.
Modular and Sequential Construction of Complex Process Models – Applications to Process Hazard Assessment Mordechai Shacham Dept. of Chem. Engng, Ben Gurion.
Sensitivity Analysis for the Purposes of Parameter Identification of a S. cerevisiae Fed-batch Cultivation Sensitivity Analysis for the Purposes of Parameter.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Fundamentals of Data Analysis Lecture 10 Correlation and regression.
I. Statistical Methods for Genome-Enabled Prediction of Complex Traits OUTLINE THE CHALLENGES OF PREDICTING COMPLEX TRAITS ORDINARY LEAST SQUARES (OLS)
Nonlinear Regression 1Michael Sokolov / Numerical Methods for Chemical Engineers / Nonlinear Regression Michael Sokolov ETH Zurich, Institut für Chemie-
Ondrej Ploc Part 2 The main methods of mathematical statistics, Probability distribution.
Stats Methods at IC Lecture 3: Regression.
Virtual University of Pakistan
Purposeful Model Parameters Genesis in Simple Genetic Algorithms
Physics 114: Lecture 13 Probability Tests & Linear Fitting
Problem Solving in Chemical Engineering with Numerical Methods
Regression Analysis Module 3.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Correlation and Simple Linear Regression
Ben-Gurion University
Chapter 4: Seasonal Series: Forecasting and Decomposition
Stochastic Hydrology Hydrological Frequency Analysis (II) LMRD-based GOF tests Prof. Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering.
Statistical Methods For Engineers
CONCEPTS OF ESTIMATION
Solution of Equations by Iteration
CHAPTER 14: Confidence Intervals The Basics
Arithmetic Mean This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Simon Lineykin and Sam Ben-Yaakov*
Romberg Rule of Integration
Romberg Rule of Integration
Threshold Autoregressive
Brian L. Sprague, Robert L. Pego, Diana A. Stavreva, James G. McNally 
Interval Estimation of mean response
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Realistic Assessment of Parameter Uncertainty in Dynamic Parameter Estimation Mordechai Shacham, Dept. of Chem. Engng, Ben Gurion University of the Negev, Beer-Sheva, Israel Neima Brauner, School of Engineering, Tel-Aviv University, Tel-Aviv, Israel Abstract Assessment of the uncertainty in parameter estimation is essential for confidence in subsequent use of the dynamic model with the associated parameters. The assessment of the parameter uncertainty in highly nonlinear kinetic models is often a very difficult task. In this paper a new method for parameter uncertainty assessment is presented and its use is demonstrated for a cellulose hydrolysis kinetic model. The new method involves generation of pseudo-experimental data using a known set of “reference” parameter values. Stepwise regression is used in an attempt to generate alternative sets of parameter values that yield results with precision similar to the reference set. The difference between the individual parameter values in the separate sets represents the uncertainty of these values. High uncertainty level indicates that no physical meaning can be attributed to the predicted parameter values. Application of the proposed method is therefore recommended prior to applying the individual parameter values in other models. Kinetic closed-system model of enzymatic saccharification of linocellulosic biomass1 Model Re-parametrization to Allow for Stepwise Regression Data Model Part of the model, which is essential for describing the parameter estimation process, is shown in the equation sets above. The model contains three state variables: S, G2 and G, for the substrate, cellobiose and glucose concentration, respectively and 12 kinetic parameters: k1r, K1IG2, K1IG, k2r, K2IG2, K2IG, k3r, K3M and K3IG. Using the experimental data provided by Kadam et al.1 , Sin et. al2 obtained estimates for the parameter values (including 95% confidence intervals). Observe that all the confidence intervals are much larger in absolute value than the parameter values. This indicates that for the suggested model (and the available data) the parameters are un-identifiable. Re-parametrization is carried out in order to prevent singularity of the model when a parameter value is set at zero. In r1 (original expression) there are divisions by two parameters: K1IG2 and K1IG. In order to prevent singularity when these parameters are set at zero they are replaced by: π1 = 1/ K1IG2 and π2 = 1/ K1IG. Similarly, the parameters in groups r2 and r3 are replaced as necessary, yielding: π3 = 1/ K2IG2, π4 = 1/ K2IG, π5 = k3r / K3M, π6 = 1/ K3IG and π7 = 1 / K3M. Inspection of (re-parametrized) group r1 reveals that setting the parameter k1r at zero nullifies the whole group. Thus, k1r cannot be set at zero (assigned H = 1 hierarchy), while the rest of the parameters in group r1 : π1 and π2 , are of H = 2. Similarly for groups r2 and r3, k2r and π5 are of H = 1, and π3, π4, π6 , π7 are of H = 2. Estimated parameters2 1Source: Kadam et. al, Biotechnol. Prog., 20, 698-705 (2004). 2Source: Sin et. al, Comp. Chem. Engng., 34, 1385-1392 (2010). Stepwise Regression for the Simulated Data Identification of different sets of parameter values that represent the data at similar precision levels Stepwise regression was carried out using the simulated data. All parameter values were bounded: 10-3 lower bound and 100 upper bound. The parameter values obtained by stepwise regression are significantly different from the “data verification” parameters, however, the sum of squared errors (Ф values) are very similar (7.16*10-5 vs. 4.23 *10-5 ). The proposed method for assessing parameter uncertainty involve the use of stepwise regression to identify different sets of parameter values that represent the available data at similar precision levels. For this aim simulated “experimental” data ( 150 data points) were generated using the re-parametrized model and the parameter values provided by Kadam et al1. To verify the agreement of the stored “experimental” data with the re-parametrized kinetic model and the “Reference” parameter values, parameter identification was carried out using the fmincon function of MATLAB yielding the following results. The π4 value is very close to the upper bound (set for the stepwise regression solution) and the value changes upon changing the upper bound. In spite of the large differences in the parameter values, the stepwise regression curves fit well the simulated experimental data. Observe the close agreement between the “reference” and “data verification” parameter values Sum of squared differences between simulated “experimental” data and calculated values 1Source: Kadam et. al, Biotechnol. Prog., 20, 698-705 (2004). Conclusions The results demonstrate that the proposed method enables identification of different sets of parameter values which can represent the available data at similar precision levels. The difference between the individual parameter values in the different sets represents the uncertainty of the parameter values. The main source of the parameters’ uncertainty is the interaction between the parameters. In such case no physical meaning can be attributed to the parameter values. Application of the proposed method is therefore recommended prior to borrowing the individual parameter values for application in other models.