Simple Interval Calculation bi-linear modelling method. SIC-method Rodionova Oxana Semenov Institute of Chemical Physics RAS & Russian.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

3.3 Hypothesis Testing in Multiple Linear Regression
Kin 304 Regression Linear Regression Least Sum of Squares
The Simple Regression Model
Regression analysis Relating two data matrices/tables to each other Purpose: prediction and interpretation Y-data X-data.
Sampling: Final and Initial Sample Size Determination
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Correlation and Regression
1 Simple Interval Calculation (SIC-method) theory and applications. Rodionova Oxana Semenov Institute of Chemical Physics RAS & Russian.
1 Status Classification of MVC Objects Oxana Rodionova & Alexey Pomerantsev Semenov Institute of Chemical Physics Russian Chemometric Society Moscow.
Econ 140 Lecture 121 Prediction and Fit Lecture 12.
BA 555 Practical Business Analysis
Statistics: Data Analysis and Presentation Fr Clinic II.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
SVM for Regression DMML Lab 04/20/07. SVM Recall Two-class classification problem using linear model:
CHAPTER 3 Describing Relationships
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
Standard error of estimate & Confidence interval.
Correlation & Regression
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Introduction to Linear Regression and Correlation Analysis
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
Model Checking Using residuals to check the validity of the linear regression model assumptions.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Statistics for Business and Economics Dr. TANG Yu Department of Mathematics Soochow University May 28, 2007.
Descriptive Statistics: Variability Lesson 5. Theories & Statistical Models n Theories l Describe, explain, & predict real- world events/objects n Models.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Successive Bayesian Estimation Alexey Pomerantsev Semenov Institute of Chemical Physics Russian Chemometrics Society.
WSC-4 Simple View on Simple Interval Calculation (SIC) Alexey Pomerantsev, Oxana Rodionova Institute of Chemical Physics, Moscow and Kurt Varmuza.
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
Subset Selection Problem Oxana Rodionova & Alexey Pomerantsev Semenov Institute of Chemical Physics Russian Chemometric Society Moscow.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 14 Inference for Regression © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
Statistical Methods II: Confidence Intervals ChE 477 (UO Lab) Lecture 4 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Prediction of NMR Chemical Shifts. A Chemometrical Approach К.А. Blinov, Y.D. Smurnyy, Т.S. Churanova, М.Е. Elyashberg Advanced Chemistry Development (ACD)
Chimiometrie 2009 Proposed model for Challenge2009 Patrícia Valderrama
CHAPTER 1 EVERYTHING YOU EVER WANTED TO KNOW ABOUT STATISTCS.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
WSC-5 Hard and soft modeling. A case study Alexey Pomerantsev Institute of Chemical Physics, Moscow.
D/RS 1013 Discriminant Analysis. Discriminant Analysis Overview n multivariate extension of the one-way ANOVA n looks at differences between 2 or more.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
The simple linear regression model and parameter estimation
JMP Discovery Summit 2016 Janet Alvarado
Deep Feedforward Networks
Part 5 - Chapter
Correlation, Bivariate Regression, and Multiple Regression
CHAPTER 3 Describing Relationships
Data Mining: Concepts and Techniques
CH 5: Multivariate Methods
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Classification Discriminant Analysis
Statistical Methods For Engineers
Regression Computer Print Out
Simple Regression Mary M. Whiteside, PhD.
LESSON 24: INFERENCES USING REGRESSION
Regression Models - Introduction
LESSON 4.4. MULTIPLE LINEAR REGRESSION. Residual Analysis
BA 275 Quantitative Business Methods
Regression and Correlation of Data
Recognition of the 'high quality’ forgeries among medicines
Presentation transcript:

Simple Interval Calculation bi-linear modelling method. SIC-method Rodionova Oxana Semenov Institute of Chemical Physics RAS & Russian Chemometric Society

Stages of Multivariate Data Analysis Experimental design (DOE) 1.minimizing the total number of experiments 2. obtain as much “information” as possible. Validation Prediction accuracy of prediction ? Modelling Maximally informative model

Simple Interval Calculation (SIC) gives the result of the prediction directly in an interval form Interval calculationSimple 1.simple idea lies in the background 2. well-known mathematical methods are used for its implementation.

Main Assumption of SIC-method All errors are limited. Normal (  ) distribution Finite (  ) distributions

The Region of Possible Values (RPV)

The RPV A Properties An example of RPV (heptagon) with vertexes 1, 2,..7

SIC Prediction V-prediction interval U-test interval

What Can Go Wrong? “True” values lie outside of the prediction intervals Prediction intervals are far less than test intervals Very large prediction intervals

Quality of Prediction (Half) WIDTH of Prediction Interval INCLUDE - whether a reference value lies in Prediction Interval SEPI - Standard Error of Interval Prediction OVERLAP a fraction of Test interval, within Prediction interval.

Mean Values

Unknown . How to Find It?

Octane Rating Example X-predictors are NIR-measurements (absorbance spectra) over 226 wavelengths, Y –response is reference measurements of octane number. Training set =26 samples Test set =13 samples Spectral dada

Octane Rating Example

Real-world example Total number of samples (n) =15 Number of variable (p) =5 Calibration set =11 samples Testing set=4 samples Prediction of antioxidant activity using DSC measurements

SIC Object Status Theory

Boundary Sample RPV and its boundary samples “Prediction” of the calibration set

Insiders, Outsiders, Outliers

insiders, boundary samples, prediction intervals regression 90% conf. interval ‘true’ model y=xa regression line

The region of absolute outsiders Boundary samples (from calibration set) Calibration samples Test samples The border of absolute outsiders

The Sample Status in the Response Space

SIC– leverage / SIC–residual SIC– leverage MED-normalized SIC–residual SIC–residual Leverage – a measure of how far a data point to the majority Residual – a measure of the variation that is not taken into account by the model

SIC Object Status Map

The Main Features of the SIC-method SIC - METHOD gives the result of prediction directly in the interval form. calculates the prediction interval irrespective of sample position regarding the model. summarizes and processes all errors involved in bi- linear modelling all together and estimates the Maximum Error Deviation for the model provides wide possibilities for sample classification and outlier detection