1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence.

Slides:



Advertisements
Similar presentations
Hypothesis testing and confidence intervals by resampling by J. Kárász.
Advertisements

Mean, Proportion, CLT Bootstrap
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sensitivity Analysis In deterministic analysis, single fixed values (typically, mean values) of representative samples or strength parameters or slope.
Sampling: Final and Initial Sample Size Determination
Statistics review of basic probability and statistics.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Lecture 3 Probability and Measurement Error, Part 2.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
The Simple Regression Model
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Sampling Distributions
Lecture 8 The Principle of Maximum Likelihood. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
CHAPTER 6 Statistical Analysis of Experimental Data
Bootstrapping LING 572 Fei Xia 1/31/06.
Statistics and Probability Theory Prof. Dr. Michael Havbro Faber
Continuous Random Variables and Probability Distributions
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Lecture II-2: Probability Review
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Introduction to Monte Carlo Methods D.J.C. Mackay.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Estimation Basic Concepts & Estimation of Proportions
Short Resume of Statistical Terms Fall 2013 By Yaohang Li, Ph.D.
Estimation of Statistical Parameters
Random Sampling, Point Estimation and Maximum Likelihood.
Module 1: Statistical Issues in Micro simulation Paul Sousa.
LECTURER PROF.Dr. DEMIR BAYKA AUTOMOTIVE ENGINEERING LABORATORY I.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Week 41 Estimation – Posterior mean An alternative estimate to the posterior mode is the posterior mean. It is given by E(θ | s), whenever it exists. This.
Week 21 Stochastic Process - Introduction Stochastic processes are processes that proceed randomly in time. Rather than consider fixed random variables.
Lecture 12: Linkage Analysis V Date: 10/03/02  Least squares  An EM algorithm  Simulated distribution  Marker coverage and density.
Robust Estimators.
Computer Vision Lecture 6. Probabilistic Methods in Segmentation.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
1 / 12 Michael Beer, Vladik Kreinovich COMPARING INTERVALS AND MOMENTS FOR THE QUANTIFICATION OF COARSE INFORMATION M. Beer University of Liverpool V.
STATISTICS AND OPTIMIZATION Dr. Asawer A. Alwasiti.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Analysis of Experimental Data; Introduction
BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.
Continuous Random Variables and Probability Distributions
1 EE571 PART 3 Random Processes Huseyin Bilgekul Eeng571 Probability and astochastic Processes Department of Electrical and Electronic Engineering Eastern.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
CHAPTER- 3.1 ERROR ANALYSIS.  Now we shall further consider  how to estimate uncertainties in our measurements,  the sources of the uncertainties,
1 Ka-fu Wong University of Hong Kong A Brief Review of Probability, Statistics, and Regression for Forecasting.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Oliver Schulte Machine Learning 726
ESTIMATION.
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
More about Posterior Distributions
STOCHASTIC HYDROLOGY Random Processes
Parametric Methods Berlin Chen, 2005 References:
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
DESIGN OF EXPERIMENT (DOE)
Bootstrapping and Bootstrapping Regression Models
Introductory Statistics
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence in the Range of Variability

2 Problem definition  Suppose, only few measured values of an uncertain quantity are available: Is it under such circumstance possible to establish a credible probability distribution for the reliability assessment? - Without any strong assumptions, certainly not ! - There is an infinite number of options ! - Some physical background information is needed to proceed any further.

3 Problem definition (cont.)  Among the infinite set of options, the choice should reflect the needs of the analysis  The uncertainty due to the insufficient amount of data points should be considered  For estimating the performance (safety assessment) confidence in the estimates will be crucial. Some options:

4 Problem definition (cont.)  Only few measured values of an uncertain quantity are available: Is it under such circumstance possible to establish a credible probability distribution for the reliability assessment? Yes if: - Some physical background information can be safely assumed - We are not looking for the best estimate (e.g. a Bayesian approach) but for an conservative PDF estimate for a required confidence level.

5 Overview  Bootstrap procedure  Statistical results, e.g. confidence intervals  Probability of observation  Probability of lying outside the observed domain  Probability density estimation  Extended bootstrap procédure  Marginal distributions for calibration data  Joint distributions  Results  Conclusions

6 Bootstrap procedure  Bootstrap procedure:  Modern, computer-intensive, general purpose approach to statistical inference.  Approach to compute properties of an estimator (e.g. variance, confidence intervals, correlations).  Advantage: Straightforward also for complex estimators and complex distributions.  Disadvantage: Tendency to be too optimistic for small sample sizes.

7 Bootstrap procedure  Bootstrap procedure (cont.):  Given the data set  Resampling: Generate artificially a large number N of sets from data by random sampling  Determine for each of the N sets the estimator (e.g. mean, variance, etc.), establish the histogram and derive confidence intervals: Estimator (e.g.variance)

8 Bootstrap procedure  Bootstrap procedure (cont.):  Resampling corresponds to sampling from the discrete probability distribution  The inference is only justified in case the sample represent the underlying unknown distribution well.  The method is not reliable if only very few data are availble, i.e. in case n is small.  The case n < 30 will be investigated in the following:

9 Probability mass outside the observation range  N > 1 data points specify the observed range  Define interval [a,b]  Assume independent data points  Suggestion : Interpret q N as level of significance confidence level = 1-

10 Probability mass outside the observation range  Probability

11 Probability density  Density outside the observed domain Until now we just have an estimate for the probability, not the density! Almost everthing is possible without any physical background information  Reasonable (physical) assumptions:  The density is high in the neighbourhood of any observations  The density decreases with its distance from observations  The density has a single domain with PDF(x)>0

12 Proposed PDF  Extended bootstrap distribution:  Replace the underlying discrete bootstrap probability distribution  by continuous Gaussian kernel density functions

13 Proposed PDF  Kernel densities: N gaussian densitities centered at the data points Justification: + each data point has equal weight and provides identical information + each data point has the same variability + the probability of occurrence decreases with the distance  is used to specify the standard deviation

14 Application to data  Calibration experiments: Young's modulus elongation Three data sets Notation: inverse Young's modulus average inverse Young's modulus over the length

15 Application to calibration data  Inverse Young's modulus Exceptionally large dispersion for N c =5 when compared with N c =30. The distribution is function of the amount of data points N c and the required confidence level 1- a.

16 Application to calibration data  Average inverse Young's modulus Exceptionally small dispersion for N c =5 when compared with N c =30. The distribution is function of the amount of data points N c and the required confidence level 1- a.

17 Application to calibration data  Joint distribution as function of N c and significance level

18 Application to calibration data  Joint distribution as function of N c and significance level

19 Application to calibration data  Joint distribution as function of N c and significance level

20 Random field calibration  Random field model  simple piecewise linear  i.i.d inverse Young's moduli  distribution of (maximum entropy principle) derives from mechanics

21 Random field calibration  Fitting of The average correlation length D is selected such that it fits the joint distribution best.  Simple Monte Carlo search

22 Application: Static Challenge problem  Prediction of Exceedance probability  Young's modulus in all four bars modelled as random field  Challenge: Estimation of exceedance probability

23 Application: Static Challenge problem  Prediction of Exceedance probability  Consistent results  Severe underestimation without introducing a low level of

24 Summary and Conclusion The spread of the assumed probability distribution is a function of the number of data points and the required confidence level. The introduction of confidence level provides a suitable safeguard against a severe underestimation of the variability of the parameters derived from a small data set. Consistent results can be obtained although the small data set might be misleading.

25 Acknowledgment This research is partially supported by the European Commission under contract # RTN (MADUSE)